Transparency Toolkit collects data from a variety of open sources and makes tools to automate this process. These are some of the data sources we use and the software we created to gather data from them. For additional tools and data sources, check our GitHub.
People post all sorts of interesting information in their resumes on LinkedIn. Our LinkedIn crawler automates the process of collecting public LinkedIn profiles matching search terms or from “people also viewed” lists.
It is possible to gather unstructured data from almost any website via Google. This software automates Google searches, including support for advanced search operators (like site or inurl).
Indeed is a job listing and resume posting site. We have software for collecting these listings.