Data Dictionary: Clean Company API
Data dictionary for data retrieved using Clean Company API endpoints in Database API.
This data dictionary shows all available data points, explains their values, and provides data samples from the Clean Company API data.
The data provided in the samples is strictly intended for illustrative purposes, allowing you to visualize its appearance and format.
Data point | Processing | Description | Data type |
---|---|---|---|
last_updated | Cleaned | Record update date | String (date) |
professional_network_source_id | Raw | Record identification key assigned by Professional Network | String |
created_at | Cleaned | Time and date when we created the company record | String (date) |
Data point | Processing | Description | Data type |
---|---|---|---|
id | Raw | Company ID in our database | Number (integer) |
name | Cleaned | Company name | String |
logo | Cleaned | BASE64 encoded JPEG image of the company's logo | String |
ticker | Cleaned | Company's stock ticker | String |
exchange | Cleaned | Company's stock exchange | String |
Data point | Processing | Description | Data type |
---|---|---|---|
industry | Cleaned | Industry the company operates in | String |
type | Cleaned | Company type | String |
founded | Cleaned | Company founding year | String |
size_range | Cleaned | Company size range | String |
size_employees_count | Enriched | The number of employees working in the company | Number (integer) |
followers | Cleaned | The number of company followers | Number (integer) |
description | Cleaned | Company description | String |
specialities | Raw | Company specialties | Array of strings |
metadata_title | Enriched | Company title parsed from additional sources | String |
metadata_description | Enriched | Company description parsed from additional sources | String |
enriched_summary | Enriched | LLM enriched company summary | String |
enriched_category | Enriched | Company category assigned with LLM | String |
enriched_keywords | Enriched | LLM enriched company keywords | Array of strings |
enriched_b2b | Enriched | Marks if the company offers B2B products/services enriched with the help of LLM 1 – B2B company 0 – not B2B company | Number (double) |
Data point | Processing | Description | Data type |
---|---|---|---|
pricing_available | Enriched | Marks if the company service pricing is available online | Boolean |
free_trial_available | Enriched | Marks if the company offers a free trial of their services | Boolean |
demo_available | Enriched | Marks if the company offers a demo | Boolean |
is_downloadable | Enriched | Marks if the company offers a downloadable file/service | Boolean |
mobile_apps_exist | Enriched | Marks if the company has mobile apps for their service | Boolean |
online_reviews_exist | Enriched | Marks if the company has any online reviews | Boolean |
api_docs_exist | Enriched | Marks if the company has API docs published | Boolean |
Data point | Processing | Description | Data type |
---|---|---|---|
phone_numbers | Enriched | Publicly available company phone number | Array of strings |
emails | Enriched | Publicly available company email address | Array of strings |
Data point | Processing | Description | Data type |
---|---|---|---|
websites_main_original | Raw | Company website URL | String |
websites_main | Cleaned | Cleaned and resolved company website URL | String |
websites_resolved | Enriched | Resolved company website URL | String |
websites_facebook | Enriched | Company Facebook URL | String |
websites_twitter | Enriched | Company Twitter URL | String |
websites_professional_network | Raw | Company professional network URL | String |
websites_professional_network_canonical | Raw | Canonical professional network profile URL | String |
social_discord_urls | Enriched | Company discord profile/channel | Array of strings |
social_facebook_urls | Enriched | Company Facebook page | Array of strings |
social_instagram_urls | Enriched | Company Instagram page | Array of strings |
social_professional_network_urls | Enriched | Company professional network profile | Array of strings |
social_pinterest_urls | Enriched | Company Pinterest page | Array of strings |
social_tiktok_urls | Enriched | Company TikTok profile | Array of strings |
social_twitter_urls | Enriched | Company Twitter profile | Array of strings |
social_x_urls | Enriched | Company X profile | Array of strings |
social_youtube_urls | Enriched | Company YouTube channel/profile | Array of strings |
social_github_urls | Enriched | Company Github page/profile | Array of strings |
social_reddit_urls | Enriched | Company Reddit profile | Array of strings |
Data point | Processing | Description | Data type |
---|---|---|---|
location_hq_country | Cleaned | Company headquarters country | String |
location_hq_raw_address | Cleaned | Detailed company location | String |
location_hq_regions | Enriched | Geographical region(s) the company is associated with based on the company_location_hq_country value. | String |
locations_full | Raw | Full company location information | Array of objects |
location_adress | Raw | Full location of the company HQ | String |
is_primary | Raw | Marks if the listed location is the primary | Boolean |
Data point | Processing | Description | Data type |
---|---|---|---|
funding_rounds | | Information on company funding (rounds) | Array of objects |
last_round_investors_count | Cleaned | The number of investors that participated in the last funding round | Number (integer) |
total_rounds_count | Cleaned | Total number of funding rounds | Number (integer) |
last_round_type | Cleaned | Last funding round type | String |
last_round_date | Cleaned | Last funding round date | String |
last_round_money_raised | Cleaned | Amount of money raised during the last funding round | number (integer) |
financial_website_url | Raw | Last funding round financial website URL | String |
Data point | Processing | Description | Data type |
---|---|---|---|
technologies | - | Data type changed from array of strings to array of structs | Array of structs |
technology | Enriched | Technology name | String |
first_verified_at | Cleaned | Date this technology was first assigned to the company | String (date) |
last_verified_at | Cleaned | Date this technology was last assigned to the company | String (date) |
Data point | Processing | Description | Data type |
---|---|---|---|
expired_domain | Enriched | Marks if the company_websites_main_originalURL redirects to a domain dealer | Number (integer) |
unique_domain | Enriched | Marks if only this company has the right to have this unique domain, e.g., company_websites_main: https://ibm.com | Number (integer) |
unique_website | Enriched | Marks if only this company has a unique website but not necessarily a unique domain, e.g., company_websites_main: https://ibm.com/generation | Number (integer) |
Data point | Processing | Description | Data type |
---|---|---|---|
updates | | Company posts and related details | Array of objects |
urn | Raw | String-based identifier | String |
followers | Raw | Number of followers | String |
date | Raw | Post publish date (e.g., 1 month ago) | String |
description | Raw | Published text Note: may contain control characters | String |
reactions_count | Raw | Number of reactions on the post | Integer |
comments_count | Raw | Number of comments on the post | Integer |
reshared_post_author | Raw | Reshared post author | String |
reshared_post_author_url | Raw | Author's profile URL | String |
reshared_post_author_headline | Raw | Author's headline | String |
reshared_post_description | Raw | Reshared post text | String |
reshared_post_followers | Raw | The number of followers of the reshared post author | Integer |
reshared_post_date | Raw | Date the reshared post was published (e.g., 1 month ago) | String |