Employee Data
Clean Employee Data

Dictionary: Clean Employee Data

44min
request access to our full documentation this is a simplified version of our documentation if you want to access additional data samples learn more about our cleaning and enrichment process explore the complete list of data sources we offer contact our team and get access to more information clean employee data provides high quality, structured workforce data that is ready for immediate use our data is meticulously cleaned and enriched, enabling businesses to streamline operations, enhance decision making, and optimize workforce analysis by leveraging clean employee data, organizations can reduce engineering overhead, gain access to additional insights, and work with optimized data formats for improved efficiency the data is available in jsonl, parquet, and csv formats, ensuring faster downloads and seamless integration with flexible retrieval options—including flat file downloads and api access—businesses in sales tech, hr intelligence, and investment sectors can efficiently access the workforce insights they need clean employee data is derived from our base employee data docid\ ugn n 15oufcqaxwbou1y overview the data points are separated into collections to visualize the data better dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w dictionary clean employee data docid\ gr6gnwn5jwnu9hf8xtt3w metadata data point processing description data type member last updated cleaned date the record was last updated s tring member is deleted raw indicates whether the profile was accessible 1 d eleted or private 0 publicly available integer data sample meta data "member last updated" "2023 07 29", "member is deleted" 0 cleaning actions data point cleaning action member last updated value is converted to the yyyy mm dd format identifiers data point processing description data type member id raw identification key in our database integer member websites professional network raw professional network profile url string member picture url raw profile picture url string member full name cleaned full name string member name first raw first name string member name middle enriched middle name string member name last enriched last name string member shorthand names raw a list of all historical employee shorthand names array of s trings member follower count raw number of profile followers integer data sample identifiers "member id" 4290, "member full name" "john leonardo doe", "member name first" "john", "member name middle" "leonardo", "member name last" "doe", "member websites professional network" "https //www professional network com/in/john leonardo doe", "member picture url" "https //static lnk com/aero v1/sc/h/9c8pery4andzj6ohjkjp54ma2", "member shorthand names" \[ "john lenoardo doe" ], "member follower count" 445, cleaning actions data point cleaning action member full name special characters/emojis are removed; any words that follow a comma or in parentheses are removed; titles (preceding or following the name) are removed member name middle parsed from member full name member name last parsed from member full name skills data point processing description data type member skills enriched list of employees' skills array of strings data sample skills "member skills" \[ "creative", "design", "electronics", "photography", "programming" ] enriching action data point enriching action member skills enriched with our ml model from different description fields experience data point processing description data type member description raw job position description s tring company id enriched identification key for the company associated with the employee's experience integer member job title cleaned current job position title s tring is decision maker enriched indicates whether the employee is a decision maker based on member job title 1 e mployee is marked as a decision maker in the current role 0 e mployee is not marked as a decision maker in the current role integer member job description raw current job position description s tring member headline raw job t itle found in the profile headline string member generated headline raw a user written headline that can be found in web search, also viewed and other publicly available spaces it serves the same purpose as the title but is derived from a different source, potentially providing more accurate and up to date profile information this field should be used in place title as it reflects the latest user activity string total experience duration enriched summed up experience (displayed as years and months) string total experience duration months enriched summed up employee experience (displayed as months) integer data sample experience "member description" "results driven professional with extensive experience in supervisory roles, business analysis, project management, and financial analysis skilled in managing enterprise wide implementations of healthcare information systems, with expertise in gathering and defining client data requirements ", "company id" 1111111, "member job title" "senior consultant", "is decision maker" 1, "member job description" "senior business analyst @ company123", "member headline" "healthcare consultant", "member generated headline" "healthcare consultant at company 123", "total experience duration" "2 years 4 months", "total experience duration months" 28, cleaning and enriching actions data point cleaning/enriching action company id company id from an active experience record from member experience job title special characters are removed total experience duration values converted to readable text total experience duration months field aggregated from duration values the member experience table is mapped with our historical data due to professional network hiding the work experience on certain employees' profiles data point processing description data type member experience employee's work experience array of objects company id raw workplace (company) identifier in our database integer date from cleaned employment start date string (date) date from year cleaned employment start year integer date from month cleaned employment start month integer date to cleaned employment end date string (date) date to year cleaned employment end year integer date to month cleaned employment end month integer company url raw employee 's workplace url on professional network string company name raw employer company title raw job title string department enriched department the e mployee works in string management level enriched employee 's management level string description cleaned job description string order in profile raw record order as seen on the e mployee 's profile integer duration enriched employment duration string (date) duration months cleaned employment duration in months integer location cleaned job/workplace location string data sample experience "member experience" \[ { "company id" 1774347, "date from" "2015 10 01", "date from year" 2015, "date from month" 10, "date to" "2016 09 01", "date to year" 2016, "date to month" 9, "company name" "company123, ltd ", "company url" "https //www professional network com/company/company123", "title" "senior analyst", "description" "financialconsulting for a leading manufacturing organizations ", "order in profile" 5, "duration" "1 year", "duration months" 12, "department" "project management", "management level" "senior", "location" "jacksonville, florida area" } ], cleaning and enriching actions data point cleaning/enriching action date from value is converted to the yyyy mm dd format date from year date from month year value extracted from date from value value converted to integer date to value is converted to the yyyy mm dd format date to year date to month year value extracted from date to value value converted to integer department enriched with our ml model from the title value management level enriched with our ml model from the member job title value description values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none ; value is replaced to none if the description is shorter than 3 characters; text styling tags removed; multiple spaces are replaced with single ones duration derived from date from and date to values duration months duration converted into a numerical value location values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none data point processing description data type member department enriched departments derived from the member job title s tring member management level enriched management levels identified from the member job title s tring is working enriched represents if the employee is currently working 0 the employee is currently not working 1 the employee is currently working b oolean data sample experience "member department" "project management", "member management level" "senior", "is working" 1 enriching actions data point cleaning/enriching action member department enriched with our ml model from the member job title value member subdepartment enriched with our ml model from the member job title value member management level enriched with our ml model from the member job title value is working based on date to and date from values of employee experience education data point processing description data type member education employee 's education array of objects major cleaned field of study s tring title cleaned educational institution s tring date to cleaned graduation date string date from cleaned enrolment date string institution url cleaned institution's profile url s tring description cleaned education description s tring activities and societies cleaned details about activities and societies s tring data sample education "member education" \[ { "major" "associate's degree, business administration and management", "title" "business college", "date to" "2017", "date from" "2015", "institution url" "https //www professional network com/school/business college", "description" "attended business college from 2015 to 2017", "activities and societies" "activities and societies phi theta kappa" } ], cleaning actions data point cleaning action title values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none; values are capitalized major values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none date from value is converted to the yyyy format date to value is converted to the yyyy format institution url values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none description values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none text styling tags are removed; multiple spaces are replaced with single ones activities and societies values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none text styling tags are removed; multiple spaces are replaced with single ones hidden collections data point description data type is hidden marks if the employee profile has a hidden education/experience collection 0 – education/experience information was available at the time of profile scraping 1 – education/experience information was not available at the time of profile scraping number (integer) data sample is hidden + experience "is hidden" 0, "member experience" \[ { "company id" 23124977, "date from" "2020 02 01", "date to" "2020 09 01" }, { "company id" 3140930, "date from" "2023 06 01", "date to" null } ] } is hidden + education "member education" \[ { "title" "harvard law school", "major" null, "date from" null, "date to" null } ], "is working" 1, location data point processing description data type member location raw address cleaned raw address of the employee's location s tring member location country cleaned country of the employee's location s tring member location regions cleaned geographical regions within the employee's country s tring data sample location "member location raw address" "nashville metropolitan area united states", "member location country" "united states", "member location regions" "northern america", cleaning actions data point cleaning action location raw address values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none ; special trailed characters are trimmed; value is set to none if it is shorter than three characters; the value of member location country is added at the end of the string location country values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none recommendations and connections data point processing description data type member recommendations cleaned employee recommendations array of objects recommendation cleaned recommendation text s tring referee name raw referee's name s tring referee url raw referee's profile url s tring member recommendations count cleaned number of received recommendations integer member connections count raw n umber of employee's connections integer data sample recommendations and connections "member recommendations" \[ { "recommendation" "“john was a great asset in collaborating the tasks in different departments to produce the same goal he was great at providing advice and asking questions to avoid even a tiny error during the process great to work with him!”", "referee name" "marry doe", "referee url" "www professional network com/in/marry doe", "order in profile" 1 } ], "member recommendations count" 1, "member connections count" 15535, cleaning actions data point cleaning action member recommendations deleted rows are filtered out recommendation values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none ; value is set to none if it is shorter than three characters; text styling tags are removed; multiple spaces are replaced with single ones; empty recommendations are filtered out member recommendations count values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none ; none values are replaced with 0 and made an integer languages data point processing description data type member languages employee's language knowledge array of objects language cleaned language s tring proficiency cleaned language proficiency s tring order in profile raw record order in the section integer data sample languages "member languages" \[ { "language" "english", "proficiency" "intermediate", "order in profile" 1 } ], cleaning actions data point cleaning action language values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none proficiency values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none certifications data point processing description data type member certifications employee's certifications array of objects title cleaned language s tring issuer cleaned language proficiency s tring credential id cleaned record order in the section string certificate url cleaned certificate url s tring date from cleaned issue date s tring date to cleaned expiration date s tring issuer url cleaned issuer profile url s tring order in profile raw section record order integer date from year cleaned issue year integer date from month cleaned issue month integer date to year cleaned expiration year integer date to month cleaned expiration month integer data sample certifications "member certifications" \[ { "title" "data analysis certification b4", "issuer" "data school123", "credential id" "1345", "certificate url" "http //data analysis certification school123 com/verify?trk=public profile certification title", "date from" "2021 06 01", "date to" "2024 06 01", "issuer url" "https //www professional network com/company/data school 123", "order in profile" 1, "date from year" 2021, "date from month" 6, "date to year" 2024, "date to year" 6 } ], cleaning actions data point cleaning action title values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none issuer values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none date from value is converted to the yyyy mm dd format date to value is converted to the yyyy mm dd format issuer url values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none date from year date to year year v alue from date is converted to an integer date from month date to month month v alue from date is converted to an integer courses data point processing description data type member courses attended courses array of objects organizer cleaned course organizer s tring title cleaned course title s tring order in profile raw record order in the section integer data sample courses "member courses" \[ { "organizer" "it academy", "title" "microsoft certified excel expert", "order in profile" 1 } ], cleaning actions data point cleaning action organizer values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none title values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none awards data point processing description data type member awards held awards array of objects title cleaned award s tring issuer cleaned award issuer s tring description cleaned award description string date cleaned issue date s tring order in profile raw section record order integer date year cleaned issue year integer date month cleaned issue month integer date day cleaned issue day integer data sample awards "member awards" \[ { "title" "certified in inventory management", "issuer" "school of operations management", "description" "certification in production and inventory management", "date" "2001 01 01", "order in profile" 5, "date year" 2001, "date month" 1, "date day" 1 } ], cleaning actions data point cleaning action title values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none ; values are capitalized issuer values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none date value is converted to the yyyy mm dd format date year year v alue from date is converted to an integer date month month v alue from date is converted to an integer activity data point processing description data type member activity interaction with posts on professional network array of objects activity url raw post url s tring title cleaned post title s tring action cleaned interaction type s tring order in profile raw section record order integer data sample activity "member activity" \[ { "activity url" "https //www professional network com/posts/company123 incorporated healthcare laborproductivity activity 7161365554581172224 xupz", "title" "company 123 is excited to introduce our team spotlight featuring john doe! @health systems, ltd #healthcare #laborproductivity", "action" "liked by", "order in profile" 1 } ], cleaning actions data point cleaning action title values \["none"; "unknown"; "nan"; "nan"; "na"; "null"; "null"; "null"; " "; " "] are replaced with value none ; text styling tags removed; multiple spaces are replaced with single ones organizations data point description data type member organizations memberships in organizations array of structs organization organization title s tring position position in the organization s tring description description of the activity/experience in the organization s tring date from membership start date string date from year membership start year integer date from month membership start month integer date to membership end date string date to year membership end year integer date to month membership end month integer order in profile the exact position of the organization in the profile integer data sample organizations "member public profile id" "123456789", "member organizations" \[ { "organization" "example organization", "position" "lead software engineer", "description" "led a team of developers providing great services ", "date from" "2019 06", "date from year" 2019, "date from month" 6, "date to" "2023 09", "date to year" 2023, "date to month" 9, "order in profile" 1 } ], patents data point description data type member patents authored patents array of structs title patent title s tring status patent status s tring inventors inventors of the patent array of structs full name full name of the inventor string profile url profile url string order in profile order in profile integer date patent filing date string date year filling year integer date month filling month integer date day filling day integer patent url patent url string description patent description string patent or application number patent or application number string order in profile the exact position of the patent in the profile integer data sample patents "member patents" \[ { "title" "data synchronization system", "status" "granted", "inventors" \[ { "full name" "john doe", "profile url" "https //www professional network com/profile/johndoe", "order in profile" 1 }, { "full name" "jane smith", "profile url" "https //www professional network com/profile/janesmith", "order in profile" 2 } ], "date" "2022 01 01", "date year" 2022, "date month" 1, "date day" 1, "patent url" "https //wwww patents example com/us1234567", "description" "a method for efficient synchronization of distributed systems in real time environments ", "patent or application number" "us1234567b2", "order in profile" 1 } publications data point description data type member publications memberships in organizations array of structs title publication title s tring publisher publisher name s tring date publication release date s tring date year release year integer date month release month integer date day release day integer description publication description string authors authors of the publication array of structs full name full name of the author string profile url profile url string order in profile order in the profile integer publication url publication website url string order in profile the exact position of the publication in the profile integer data sample publications "member publications" \[ { "title" "microservices architecture in cloud environments", "publisher" "journal of software systems", "date" "2024 08 01", "date year" 2024, "date month" 8, "date day" 1, "description" "an in depth analysis of architectural patterns and scalability challenges in cloud native microservices ", "authors" \[ { "full name" "john doe", "profile url" "https //www professional network com/profile/johndoe", "order in profile" 1 } ], "publication url" "https //www publications example com/microservices architecture", "order in profile" 1 } ] }