Dictionary: Clean Employee Data
Request access to our full documentation
This is a simplified version of our documentation. If you want to:
- Access additional data samples
- Learn more about our cleaning and enrichment process
- Explore the complete list of data sources we offer
Contact our team and get access to more information:
Clean Employee Data provides high-quality, structured workforce data that is ready for immediate use. Our data is meticulously cleaned and enriched, enabling businesses to streamline operations, enhance decision-making, and optimize workforce analysis.
By leveraging Clean Employee Data, organizations can reduce engineering overhead, gain access to additional insights, and work with optimized data formats for improved efficiency. The data is available in JSONL, Parquet, and CSV formats, ensuring faster downloads and seamless integration.
With flexible retrieval options—including flat file downloads and API access—businesses in sales tech, HR intelligence, and investment sectors can efficiently access the workforce insights they need.
Clean Employee Data is derived from our Base Employee Data.
The data points are separated into collections to visualize the data better.
Data point | Processing | Description | Data type |
---|---|---|---|
member_last_updated | Cleaned | Date the record was last updated | String |
member_is_deleted | Raw | Indicates whether the profile was accessible: 1 - deleted or private 0 - publicly available | Integer |
Data point | Processing | Description | Data type |
---|---|---|---|
member_id | Raw | Identification key in our database | Integer |
member_websites_professional_network | Raw | Professional network profile URL | String |
member_picture_url | Raw | Profile picture URL | String |
member_full_name | Cleaned | Full name | String |
member_name_first | Raw | First name | String |
member_name_middle | Enriched | Middle name | String |
member_name_last | Enriched | Last name | String |
member_shorthand_names | Raw | A list of all historical employee shorthand names | Array of strings |
member_follower_count | Raw | Number of profile followers | Integer |
Data point | Processing | Description | Data type |
---|---|---|---|
member_skills | Enriched | List of employees' skills | Array of strings |
Data point | Processing | Description | Data type |
---|---|---|---|
member_description | Raw | Job position description | String |
company_id | Enriched | Identification key for the company associated with the employee's experience | Integer |
member_job_title | Cleaned | Current job position title | String |
is_decision_maker | Enriched | Indicates whether the employee is a decision-maker based on member_job_title 1 - Employee is marked as a decision-maker in the current role 0 - Employee is not marked as a decision-maker in the current role | Integer |
member_job_description | Raw | Current job position description | String |
member_headline | Raw | Job title found in the profile headline | String |
member_generated_headline | Raw | A user-written headline that can be found in web search, also viewed and other publicly available spaces. It serves the same purpose as the title but is derived from a different source, potentially providing more accurate and up-to-date profile information. This field should be used in place title as it reflects the latest user activity. | String |
total_experience_duration | Enriched | Summed up experience (displayed as years and months) | String |
total_experience_duration_months | Enriched | Summed up employee experience (displayed as months) | Integer |
The member_experience table is mapped with our historical data due to professional network hiding the work experience on certain employees' profiles.
Data point | Processing | Description | Data type |
---|---|---|---|
member_experience | - | Employee's work experience | Array of objects |
company_id | Raw | Workplace (company) identifier in our database | Integer |
date_from | Cleaned | Employment start date | String (date) |
date_from_year | Cleaned | Employment start year | Integer |
date_from_month | Cleaned | Employment start month | Integer |
date_to | Cleaned | Employment end date | String (date) |
date_to_year | Cleaned | Employment end year | Integer |
date_to_month | Cleaned | Employment end month | Integer |
company_url | Raw | Employee's workplace URL on professional network | String |
company_name | Raw | Employer company | |
title | Raw | Job title | String |
department | Enriched | Department the employee works in | String |
management_level | Enriched | Employee's management level | String |
description | Cleaned | Job description | String |
order_in_profile | Raw | Record order as seen on the employee's profile | Integer |
duration | Enriched | Employment duration | String (date) |
duration_months | Cleaned | Employment duration in months | Integer |
location | Cleaned | Job/workplace location | String |
Data point | Processing | Description | Data type |
---|---|---|---|
member_department | Enriched | Departments derived from the member_job_title | String |
member_management_level | Enriched | Management levels identified from the member_job_title | String |
is_working | Enriched | Represents if the employee is currently working 0 - the employee is currently not working 1 - the employee is currently working | Boolean |
Data point | Processing | Description | Data type |
---|---|---|---|
member_education | | Employee's education | Array of objects |
major | Cleaned | Field of study | String |
title | Cleaned | Educational institution | String |
date_to | Cleaned | Graduation date | Integer |
date_from | Cleaned | Enrolment date | Integer |
institution_url | Cleaned | Institution's profile URL | String |
description | Cleaned | Education description | String |
activities_and_societies | Cleaned | Details about activities and societies | String |
Data point | Description | Data type |
---|---|---|
is_hidden | Marks if the employee profile has a hidden education/experience collection. 0 - education/experience information was available at the time of profile scraping. 1 - education/experience information was not available at the time of profile scraping | Boolean |
Data point | Processing | Description | Data type |
---|---|---|---|
member_location_raw_address | Cleaned | Raw address of the employee's location | String |
member_location_country | Cleaned | Country of the employee's location | String |
member_location_regions | Cleaned | Geographical regions within the employee's country | Array of strings |
Data point | Processing | Description | Data type |
---|---|---|---|
member_recommendations | Cleaned | Employee recommendations | Array of objects |
recommendation | Cleaned | Recommendation text | String |
referee_name | Raw | Referee's name | String |
referee_url | Raw | Referee's profile URL | String |
recommendations_count | Cleaned | Number of received recommendations | Integer |
connections_count | Raw | Number of employee's connections | Integer |
Data point | Processing | Description | Data type |
---|---|---|---|
member_languages | | Employee's language knowledge | Array of objects |
language | Cleaned | Language | String |
proficiency | Cleaned | Language proficiency | String |
order_in_profile | Raw | Record order in the section | Integer |
Data point | Processing | Description | Data type |
---|---|---|---|
member_certifications | | Employee's certifications | Array of objects |
title | Cleaned | Language | String |
issuer | Cleaned | Language proficiency | String |
credential_id | Cleaned | Record order in the section | Integer |
certificate_url | Cleaned | Certificate URL | String |
date_from | Cleaned | Issue date | String |
date_to | Cleaned | Expiration date | String |
issuer_url | Cleaned | Issuer profile URL | String |
order_in_profile | Raw | Section record order | Integer |
date_from_year | Cleaned | Issue year | Integer |
date_from_month | Cleaned | Issue month | Integer |
date_to_year | Cleaned | Expiration year | Integer |
date_to_month | Cleaned | Expiration month | Integer |
Data point | Processing | Description | Data type |
---|---|---|---|
member_courses | | Attended courses | Array of objects |
organizer | Cleaned | Course organizer | String |
title | Cleaned | Course title | String |
order_in_profile | Raw | Record order in the section | Integer |
Data point | Processing | Description | Data type |
---|---|---|---|
member_awards | | Held awards | Array of objects |
title | Cleaned | Award | String |
issuer | Cleaned | Award issuer | String |
description | Cleaned | Award description | String |
date | Cleaned | Issue date | String |
order_in_profile | Raw | Section record order | Integer |
date_year | Cleaned | Issue year | Integer |
date_month | Cleaned | Issue month | Integer |
Data point | Processing | Description | Data type |
---|---|---|---|
member_activity | | Interaction with posts on professional network | Array of objects |
activity_url | Raw | Post URL | String |
title | Cleaned | Post title | String |
action | Cleaned | Interaction type | String |
order_in_profile | Raw | Section record order | Integer |