Open Net Zero logo
NASA Enterprise Directory
L o a d i n g
Owner
National Aeronautics and Space Administration - view all
Update frequencyunknown
Last updatedalmost 2 years ago
Overview

Names and contact information of NASA employees and contractors. 102,615 entries, each containing name, email, and phone.

contractorsemployees
Additional Information
KeyValue
Free text description of capture processPython: Selenium and PhantomJS for scrape, LXML for parse. Ran an exhaustive series of searches by constructing URLs. Began by searching the email field for all valid two-character combinations, followed by the wildcard '*'. If a search returned too many results to display on one page (more than 100), exhaustively appended an additional character in the next round, and so on. The process ended when searches no longer returned too many results to display on a single page. To find directory listings without email addresses, I repeated the process for last names, first names, and phone numbers. If a field included >100 identical entries, I constructed additional search loops on a case-by-case basis, all of which are included in the attached scripts. Because pages were rendered using JavaScript, I used a headless browser via Selenium and PhantomJS in Python to convert pages to static HTML. I parsed the resulting HTML files using LXML in Python, then wrote all data to a comma-delimited CSV using the package unicodecsv.
Share this Dataset
Trust Signals
Trust Framework(s)Click to view
Organisation EntryView org in directory
AssuranceIB1 Open Data (ICON)Scheme-specific:
Data Sensitivity Classunknown
Licenceunknown
Files