Data Dictionary
Contains explanations and examples of all data fields available in the Glassdoor Companies dataset.
All personal/company information mentioned within this context is entirely fictional and is solely intended for illustrative purposes.
The data points in the example snippets have been rearranged for better grouping. To see where a specific data point stands, check the full data sample here.
Data point | Description | Data type |
---|---|---|
meta | Contains metadata about the record | Object |
created_at_date | Date when the record was initially scraped | Array of numbers (integer) |
created_at_timestamp | Unix timestamp for when the record was first scraped | Float |
updated_at_date | Date when the record was last updated | Array of numbers (integer) |
updated_at_timestamp | Unix timestamp for when the record was last updated | Float |
version_id | Identifier for the dataset version | String |
source | Source from which the record was obtained | String |
object | Data object type | String |
is_deleted | Indicates if the record is available on Glassdoor | Boolean |
See a snippet of the dataset for reference:
Data point | Description | Data type |
---|---|---|
doc | Dataset starting point | Object |
source_id | Record identifier on Glassdoor | String |
main_source_id | Indicates the primary company profile when there are multiple profiles for different company branches | String |
id | Record identifier in our database | String |
See a snippet of the dataset for reference:
Data point | Description | Data type |
---|---|---|
url | Profile URL | String |
image | Logo URL | String |
description | Description Note: contains control characters | String |
mission | Mission statement | String |
See a snippet of the dataset for reference:
Data point | Description | Data type |
---|---|---|
location | Headquarters location | String |
name | Name | String |
type | Type | String |
founded | Founding year | Integer |
industry | Associated industry | String |
employee_count | Size | String |
See a snippet of the dataset for reference:
Data point | Description | Data type |
---|---|---|
ceo | Full name of the company's CEO | String |
ceo_approval_count | Approval rate of the CEO | Float |
See a snippet of the dataset for reference:
Data point | Description | Data type |
---|---|---|
revenue | Revenue | String |
See a snippet of the dataset for reference:
Data point | Description | Data type |
---|---|---|
website | Website URL | String |
contact_info | Social media profile URLs | Object |
Twitter profile URL | String | |
Facebook page URL | String |
See a snippet of the dataset for reference:
Data point | Description | Data type |
---|---|---|
job_count | Number of active job postings | Integer |
salary_count | Number of company salary posts | Integer |
benefit_count | Number of employee comments/ratings on the company benefits | Integer |
review_count | Number of employee reviews | Integer |
interview_count | Number of employee comments on job interviews | Integer |
See a snippet of the dataset for reference:
Data point | Description | Data type |
---|---|---|
rating | Ratings on Glassdoor | Object |
aggregated | Overall rating | Float |
See a snippet of the dataset for reference:
Data point | Description | Data type |
---|---|---|
star_distribution | Count of each star rating given in the reviews | Object |
1 - 5 | Number of reviews that gave one to five stars to the company | Integer |
percentage_distribution | Percentage distribution of each star rating in the reviews | Object |
1 - 5 | Percentage distribution by the total number of stars given in the reviews | Float |
See a snippet of the dataset for reference:
Data point | Description | Data type |
---|---|---|
rating | Detailed ratings Note: optional ratings that might not be visible in the data. | Object |
culture_values | Average rating for the company's culture and values | Float |
career_opportunities | Average rating for the career opportunities in the company | Float |
compensation_benefits | Average rating for the compensation and benefits in the company | Float |
senior_management | Average rating for the company's senior management | Float |
work_life_balance | Average rating for the work-life balance in the company | Float |
diversity_inclusion | Average rating for diversity and inclusion in the company | Float |
biz_outlook | Percentage of employee reviews indicating a positive business outlook | Float |
ceo | Percentage of employee reviews that approve of CEO | Float |
recommend | Percentage of employee reviews that recommend to a friend | Float |
See a snippet of the dataset for reference: