Employee Data
Clean Employee Data

Dictionary: Clean Employee Data

38min

Request access to our full documentation

This is a simplified version of our documentation. If you want to:

  • Access additional data samples
  • Learn more about our cleaning and enrichment process
  • Explore the complete list of data sources we offer

Contact our team and get access to more information:

Clean Employee Data provides high-quality, structured workforce data that is ready for immediate use. Our data is meticulously cleaned and enriched, enabling businesses to streamline operations, enhance decision-making, and optimize workforce analysis.

By leveraging Clean Employee Data, organizations can reduce engineering overhead, gain access to additional insights, and work with optimized data formats for improved efficiency. The data is available in JSONL, Parquet, and CSV formats, ensuring faster downloads and seamless integration.

With flexible retrieval options—including flat file downloads and API access—businesses in sales tech, HR intelligence, and investment sectors can efficiently access the workforce insights they need.

Clean Employee Data is derived from our Base Employee Data.

Overview

The data points are separated into collections to visualize the data better.

Data collections

Metadata

Data point

Processing

Description

Data type

member_last_updated

Cleaned

Date the record was last updated

String

member_is_deleted

Raw

Indicates whether the profile was accessible:

1 - deleted or private

0 - publicly available

Integer

Meta data



Identifiers

Data point

Processing

Description

Data type

member_id

Raw

Identification key in our database

Integer

member_websites_professional_network

Raw

Professional network profile URL

String

member_picture_url

Raw

Profile picture URL

String

member_full_name

Cleaned

Full name

String

member_name_first

Raw

First name

String

member_name_middle

Enriched

Middle name

String

member_name_last

Enriched

Last name

String

member_shorthand_names

Raw

A list of all historical employee shorthand names

Array of strings

member_follower_count

Raw

Number of profile followers

Integer

Identifiers



Skills

Data point

Processing

Description

Data type

member_skills

Enriched

List of employees' skills

Array of strings

Skills



Experience

Data point

Processing

Description

Data type

member_description

Raw

Job position description

String

company_id

Enriched

Identification key for the company associated with the employee's experience

Integer

member_job_title

Cleaned

Current job position title

String

is_decision_maker

Enriched

Indicates whether the employee is a decision-maker based on member_job_title

1 - Employee is marked as a decision-maker in the current role

0 - Employee is not marked as a decision-maker in the current role

Integer

member_job_description

Raw

Current job position description

String

member_headline

Raw

Job title found in the profile headline

String

member_generated_headline

Raw

A user-written headline that can be found in web search, also viewed and other publicly available spaces.

It serves the same purpose as the title but is derived from a different source, potentially providing more accurate and up-to-date profile information.

This field should be used in place title as it reflects the latest user activity.

String

total_experience_duration

Enriched

Summed up experience (displayed as years and months)

String

total_experience_duration_months

Enriched

Summed up employee experience (displayed as months)

Integer

Experience


The member_experience table is mapped with our historical data due to professional network hiding the work experience on certain employees' profiles.

Data point

Processing

Description

Data type

member_experience

-

Employee's work experience

Array of objects

company_id

Raw

Workplace (company) identifier in our database

Integer

date_from

Cleaned

Employment start date

String (date)

date_from_year

Cleaned

Employment start year

Integer

date_from_month

Cleaned

Employment start month

Integer

date_to

Cleaned

Employment end date

String (date)

date_to_year

Cleaned

Employment end year

Integer

date_to_month

Cleaned

Employment end month

Integer

company_url

Raw

Employee's workplace URL on professional network

String

company_name

Raw

Employer company



title

Raw

Job title

String

department

Enriched

Department the employee works in

String

management_level

Enriched

Employee's management level

String

description

Cleaned

Job description

String

order_in_profile

Raw

Record order as seen on the employee's profile

Integer

duration

Enriched

Employment duration

String (date)

duration_months

Cleaned

Employment duration in months

Integer

location

Cleaned

Job/workplace location

String

Experience



Data point

Processing

Description

Data type

member_department

Enriched

Departments derived from the member_job_title

String

member_management_level

Enriched

Management levels identified from the member_job_title

String

is_working

Enriched

Represents if the employee is currently working

0 - the employee is currently not working

1 - the employee is currently working

Boolean

Experience



Education

Data point

Processing

Description

Data type

member_education



Employee's education

Array of objects

major

Cleaned

Field of study

String

title

Cleaned

Educational institution

String

date_to

Cleaned

Graduation date

Integer

date_from

Cleaned

Enrolment date

Integer

institution_url

Cleaned

Institution's profile URL

String

description

Cleaned

Education description

String

activities_and_societies

Cleaned

Details about activities and societies

String

Education



Hidden collections

Data point

Description

Data type

is_hidden

Marks if the employee profile has a hidden education/experience collection. 0 - education/experience information was available at the time of profile scraping.

1 - education/experience information was not available at the time of profile scraping

Boolean

is_hidden + experience
is_hidden + education



Location

Data point

Processing

Description

Data type

member_location_raw_address

Cleaned

Raw address of the employee's location

String

member_location_country

Cleaned

Country of the employee's location

String

member_location_regions

Cleaned

Geographical regions within the employee's country

Array of strings

Location



Recommendations and connections

Data point

Processing

Description

Data type

member_recommendations

Cleaned

Employee recommendations

Array of objects

recommendation

Cleaned

Recommendation text

String

referee_name

Raw

Referee's name

String

referee_url

Raw

Referee's profile URL

String

recommendations_count

Cleaned

Number of received recommendations

Integer

connections_count

Raw

Number of employee's connections

Integer

Recommendations and connections



Languages

Data point

Processing

Description

Data type

member_languages



Employee's language knowledge

Array of objects

language

Cleaned

Language

String

proficiency

Cleaned

Language proficiency

String

order_in_profile

Raw

Record order in the section

Integer

Languages



Certifications

Data point

Processing

Description

Data type

member_certifications



Employee's certifications

Array of objects

title

Cleaned

Language

String

issuer

Cleaned

Language proficiency

String

credential_id

Cleaned

Record order in the section

Integer

certificate_url

Cleaned

Certificate URL

String

date_from

Cleaned

Issue date

String

date_to

Cleaned

Expiration date

String

issuer_url

Cleaned

Issuer profile URL

String

order_in_profile

Raw

Section record order

Integer

date_from_year

Cleaned

Issue year

Integer

date_from_month

Cleaned

Issue month

Integer

date_to_year

Cleaned

Expiration year

Integer

date_to_month

Cleaned

Expiration month

Integer

Certifications



Courses

Data point

Processing

Description

Data type

member_courses



Attended courses

Array of objects

organizer

Cleaned

Course organizer

String

title

Cleaned

Course title

String

order_in_profile

Raw

Record order in the section

Integer

Courses



Awards

Data point

Processing

Description

Data type

member_awards



Held awards

Array of objects

title

Cleaned

Award

String

issuer

Cleaned

Award issuer

String

description

Cleaned

Award description

String

date

Cleaned

Issue date

String

order_in_profile

Raw

Section record order

Integer

date_year

Cleaned

Issue year

Integer

date_month

Cleaned

Issue month

Integer

Awards



Activity

Data point

Processing

Description

Data type

member_activity



Interaction with posts on professional network

Array of objects

activity_url

Raw

Post URL

String

title

Cleaned

Post title

String

action

Cleaned

Interaction type

String

order_in_profile

Raw

Section record order

Integer

Activity