# Dictionary: Clean Company Data

{% hint style="success" %}

#### Request access to our full documentation

This is a simplified version of our documentation. If you want to:

* Access data samples
* Learn more about the cleaning and enrichment actions
* Explore the complete list of data sources we offer

<a href="https://coresignal.com/contact-us/?utm_source=web&#x26;utm_medium=public-docs&#x26;utm_campaign=data-consultation" class="button primary">Contact sales</a>
{% endhint %}

Clean Company Data provides high-quality, structured business data ready for immediate use. Our data is meticulously cleaned and enriched, allowing organizations to streamline their workflows and confidently make data-driven decisions. By leveraging Clean Company Data, businesses can reduce engineering overhead, gain access to additional insights, and work with optimized data formats for improved efficiency.

With multiple retrieval options, including flat file downloads in JSONL, Parquet, and CSV formats, as well as API access, our solution adapts to your needs, ensuring seamless integration into your existing data infrastructure.

Clean Company Data is derived from our [Base Company Data](/company-data/base-company-data.md).

{% hint style="info" %}
The data fields are separated into collections to visualize the data better. The data provided in the samples is strictly intended for illustrative purposes, allowing you to understand its appearance and format better.
{% endhint %}

{% tabs %}
{% tab title="Data fields per category" %}

1. [Metadata](#metadata)
2. [Identifiers](#identifiers)
3. [Firmographics](#firmographics)
4. [Product and services overview](#product-and-services-overview)
5. [Contact information](#contact-information)
6. [Social media and websites](#social-media-and-websites)
7. [Location](#location)
8. [Funding information](#funding-information)
9. [Technologies](#technologies)
10. [Supporting fields](#supporting-fields)
11. [Company updates](#company-updates)
    {% endtab %}
    {% endtabs %}

## Metadata

| Data field                       | Processing | Description                                                | Data type     |
| -------------------------------- | ---------- | ---------------------------------------------------------- | ------------- |
| `company_last_updated`           | Cleaned    | Record update date                                         | String (date) |
| `company_created_at`             | Cleaned    | Record creation date                                       | String (date) |
| `professional_network_source_id` | Raw        | Record identification key assigned by professional network | String        |

{% code title="Meta data" %}

```json
"company_created_at": "2023-12-06",
"company_last_updated": "2024-12-06",
"professional_network_source_id": "60191",
```

{% endcode %}

<details>

<summary>Cleaning actions</summary>

| Data field             | Cleaning action                                |
| ---------------------- | ---------------------------------------------- |
| `company_last_updated` | Value is converted to the *yyyy-mm-dd* format. |
| `company_created_at`   | Value is converted to the *yyyy-mm-dd* format. |

</details>

***

## Identifiers

| Data field                              | Processing | Description                                             | Data type        |
| --------------------------------------- | ---------- | ------------------------------------------------------- | ---------------- |
| `company_id`                            | Raw        | Company ID in our database                              | Number (integer) |
| `company_hash`                          | Raw        | Company profile URL processed by the MD5 algorithm.     | String           |
| `company_canonical_shorthand_name_hash` | Raw        | Canonical shorthand name processed by the MD5 algorithm | String           |
| `company_name`                          | Cleaned    | Company name                                            | String           |
| `company_logo`                          | Cleaned    | BASE64 encoded JPEG image of the company's logo         | String           |
| `company_ticker`                        | Cleaned    | Company's stock ticker                                  | String           |
| `company_exchange`                      | Cleaned    | Company's stock exchange                                | String           |

{% code title="Identifiers" %}

```json
    "company_id": 7811468,
    "company_hash": "8ef8d364df382df483f47fe3e56dc4cd",
    "company_canonical_shorthand_name_hash": "8631ca96b6f656040bf3326deeb38df6",
    "company_name": "Example Company",
    "company_logo": "/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAMCAgMCAgMDAwMEAwMEBQgFBQQEBQoHBwYIDAoMDAsKCwsNDhIQDQ4RDgsLEBYQERMUFRUVDA8XGBYUGBIUFRT/2wBDAQMEBAUEBQkFBQkUDQsNFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBQUFBT/wAARCAAjACMDASIAAhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSExBhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwD9U6K+K7P9rfx1cfFG7smj00aNHrBsltBbHd5Qm8v/AFm7O7HOfXtivc9N/aHsLLRr2612yuI3t/EFzoSGxj83zGjDOr7cgj5ByBnkHHHTtqYOtTtdXv2Pr8dwrmeAUHOKlzJP3Xd6/wBdLnsNFeX+EP2i/CXjjxNa6DpbXr6jcPNGqSQBQpiUNICd3YMp4zncMU/R/wBoHw3rj3S21vqWLc7Sz26gMd8CYHz5+9cR9cd/SsXQqp2cWeNPKcfTk4zoyTST26NtL72n9zPTaK5Tw78QYPFGlR6jYaTqr2zySxAyW6o26ORo3BBbPDIworNwknZnDPD1acnCas1o0fL3wn8TfCbwHqHie68ez6bY+JLfxJdzWzXsEjzIgkyhAUHo24jj39K0vgvN4g+JPg3xHq/hKO1Nu3jG/njkvoQfPtHgAOzeOHbeVz0GSD3r6ivvB2g6ndPc3ei6ddXD/emmtI3dvqSuTWjY2FtptrHbWlvFa28YwkUKBEX6AcCu+eLjK7Sd3bd3XyPtcZxHQr06kqdOTqT5b80lKMVG+kY8qdn5s+cbP4VePrO+iu49H0W2kjdQRarbqSm5/M2NsBVjG6oD/s88YJZqHwQ8U/2VZ/YNI0z7X5UInivEtzGMQxpIoABxkocEE9EPbj6YorL63O97I8NZ5ilLmSX3P/M838FaV4x0fw1a2dzBbQzxtJvX7QgyTIx3YWPABznHbPPNFekUVzOd3eyPJniHUm5uKu9f61CiiiszkCiiigAooooA/9k=",
    "company_ticker": "EXMP",
    "company_exchange": "NYSE",
```

{% endcode %}

<details>

<summary>Cleaning and enriching actions</summary>

| Data field       | Cleaning/enriching action                                                                                            |
| ---------------- | -------------------------------------------------------------------------------------------------------------------- |
| `company_name`   | Values *\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]* are replaced with value `None`. |
| `company_logo`   | Image is resized to 50x50px.                                                                                         |
| `company_ticker` | Values *\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]* are replaced with value `None`. |

</details>

***

## Firmographics

| Data field                              | Processing | Description                                                                                                                                                | Data type        |
| --------------------------------------- | ---------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------------- |
| `company_industry`                      | Cleaned    | Company's industry                                                                                                                                         | String           |
| `company_type`                          | Cleaned    | Company type                                                                                                                                               | String           |
| `company_founded`                       | Cleaned    | Company's founding year                                                                                                                                    | String           |
| `company_size_range`                    | Cleaned    | Company size range                                                                                                                                         | String           |
| `company_size_employees_count`          | Enriched   | The number of employees working in the company                                                                                                             | Number (integer) |
| `company_size_employees_count_inferred` | Enriched   | Estimated number of employees, calculated based on inferred employee data                                                                                  | Number (integer) |
| `company_followers`                     | Cleaned    | The number of company followers                                                                                                                            | Number (integer) |
| `company_description`                   | Cleaned    | Company description                                                                                                                                        | String           |
| `company_specialities`                  | Raw        | Company specialties                                                                                                                                        | String           |
| `metadata_title`                        | Enriched   | Company title parsed from additional sources                                                                                                               | String           |
| `metadata_description`                  | Enriched   | Company description parsed from additional sources                                                                                                         | String           |
| `company_enriched_summary`              | Enriched   | LLM enriched company summary                                                                                                                               | String           |
| `company_enriched_category`             | Enriched   | Company category assigned with LLM                                                                                                                         | String           |
| `company_enriched_keywords`             | Enriched   | LLM enriched company keywords                                                                                                                              | Array of strings |
| `company_enriched_b2b`                  | Enriched   | <p>Marks if the company offers B2B products/services enriched with the help of LLM<br><code>1</code> – B2B company<br><code>0</code> – not B2B company</p> | Integer          |

{% code title="Firmographics" %}

```json
    "company_type": "Partnership",
    "company_founded": "2010",
    "company_followers": 0,
    "company_size_range": "1-10 employees",
    "company_size_employees_count": 2,
    "company_size_employees_count_inferred": 2,
    "company_industry": "Advertising Services",
    "company_description": "We help SMEs grow their businesses through effective online marketing strategies. ",
    "company_specialities": "Email Marketing, Web Sites, Search Engine Optimisation, Inbound Marketing, Social media Marketing",
    "company_enriched_summary": "Company1 is a premier web design and digital marketing agency based in London, UK. Specializing in custom, responsive websites, they provide professional design services, training, easy content management, and ongoing support.",
    "company_enriched_keywords": [
        "website design",
        "digital marketing",
        "professional",
        "custom responsive websites",
        "training"
    ],
    "company_enriched_b2b": 1.0,
    "company_enriched_category": "Web Design",
    "metadata_title": "Marketing, London,Cost Effective Web Design",
    "metadata_description": null
```

{% endcode %}

<details>

<summary>Cleaning and enriching actions</summary>

| Data field                     | Cleaning/enriching action                                                                                                                                                                                                                                                                                                                           |
| ------------------------------ | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `company_industry`             | Values *\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]* are replaced with value `None`.                                                                                                                                                                                                                                |
| `company_type`                 | Values *\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]* are replaced with value `None`.                                                                                                                                                                                                                                |
| `company_founded`              | <ul><li>Values <em>\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]</em> are replaced with value <code>None</code>;</li><li>Values are replaced with <code>None</code> if the year is not between 500 and the current year.</li></ul>                                                                                    |
| `company_followers`            | <ul><li>Values <em>\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]</em> are replaced with value <code>0</code>;</li><li>Every value is converted to an integer.</li></ul>                                                                                                                                               |
| `company_size_range`           | <p>Some inconsistencies are fixed with overlapping values:</p><ul><li>"1 employee" – "Myself Only";</li><li>"2-10 employees" – "1-10 employees";</li><li>"501-1,000 employees" – "501-1000 employees"; </li><li>"1,001-5,000 employees" – "1001-5000 employees".</li></ul>                                                                          |
| `company_size_employees_count` | When `company_size_employees_count` is `0`, we check if we have any scraped profiles of employees working at this company. If yes, then we count how many employees are associated with it and change the value to that number. This can occur in cases when the public profile does not show some of the employees.                                |
| `company_industry`             | Values *\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]* are replaced with value `None`.                                                                                                                                                                                                                                |
| `company_description`          | <ul><li>Values <em>\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]</em> are replaced with value <code>None</code>;</li><li>Value is replaced to <code>None</code> if the description is shorter than 3 characters;</li><li>Text styling tags removed; </li><li>Multiple spaces are replaced with single ones.</li></ul> |

</details>

***

## Product and services overview

| Data field             | Processing | Description                                                | Data type |
| ---------------------- | ---------- | ---------------------------------------------------------- | --------- |
| `pricing_available`    | Enriched   | Marks if the company service pricing is available online   | Boolean   |
| `free_trial_available` | Enriched   | Marks if the company offers a free trial of their services | Boolean   |
| `demo_available`       | Enriched   | Marks if the company offers a demo                         | Boolean   |
| `is_downloadable`      | Enriched   | Marks if the company offers a downloadable file/service    | Boolean   |
| `mobile_apps_exist`    | Enriched   | Marks if the company has mobile apps for their service     | Boolean   |
| `online_reviews_exist` | Enriched   | Marks if the company has any online reviews                | Boolean   |
| `api_docs_exist`       | Enriched   | Marks if the company has API docs published                | Boolean   |

{% code title="Product and services overview" %}

```json
    "pricing_available": true,
    "free_trial_available": false,
    "demo_available": false,
    "is_downloadable": false,
    "mobile_apps_exist": false,
    "online_reviews_exist": false,
    "api_docs_exist": false,
```

{% endcode %}

<details>

<summary>Enriching actions</summary>

| Data field                                                                                                                                                                                                                                                       | Enriching action                                     |
| ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------- |
| <p><code>pricing\_available</code>,<br><code>free\_trial\_available</code>,<br><code>demo\_available</code>,<br><code>is\_downloadable</code>,<br><code>mobile\_apps\_exist</code>,<br><code>online\_reviews\_exist</code>,<br><code>api\_docs\_exist</code></p> | Information taken from the official company website. |

</details>

***

## Contact information

| Data field              | Processing | Description                              | Data type        |
| ----------------------- | ---------- | ---------------------------------------- | ---------------- |
| `company_phone_numbers` | Enriched   | Publicly available company phone number  | Array of strings |
| `company_emails`        | Enriched   | Publicly available company email address | Array of strings |

{% code title="Contact information" %}

```json
"company_phone_numbers": [
        "0000 000 000"
    ],
    "company_emails": [
        "info@company123.com"
    ],
```

{% endcode %}

<details>

<summary>Enriching actions</summary>

| Data field                                                                   | Enriching action                                     |
| ---------------------------------------------------------------------------- | ---------------------------------------------------- |
| <p><code>company\_phone\_numbers</code>,<br><code>company\_emails</code></p> | Information taken from the official company website. |

</details>

***

## Social media and websites

| Data field                                        | Processing | Description                                                                                                                | Data type        |
| ------------------------------------------------- | ---------- | -------------------------------------------------------------------------------------------------------------------------- | ---------------- |
| `company_websites_main_original`                  | Raw        | Company website                                                                                                            | String           |
| `company_websites_main`                           | Enriched   | Cleaned and resolved website URL                                                                                           | String           |
| `company_websites_facebook`                       | Enriched   | Facebook profile URL                                                                                                       | String           |
| `company_websites_twitter`                        | Enriched   | Twitter profile URL                                                                                                        | String           |
| `company_websites_professional_network`           | Raw        | Professional network URL where the company was first discovered. It can be outdated if the company has changed its profile | String           |
| `company_websites_professional_network_canonical` | Raw        | The current official Professional network URL for the company, reflecting the most recent updates                          | String           |
| `company_social_discord_urls`                     | Enriched   | Discord channel URL                                                                                                        | Array of strings |
| `company_social_facebook_urls`                    | Enriched   | Facebook profile URL                                                                                                       | Array of strings |
| `company_social_instagram_urls`                   | Enriched   | Instagram profile URL                                                                                                      | Array of strings |
| `company_social_professional_network_urls`        | Enriched   | Company professional network profile URL                                                                                   | Array of strings |
| `company_social_pinterest_urls`                   | Enriched   | Pinterest profile URL                                                                                                      | Array of strings |
| `company_social_tiktok_urls`                      | Enriched   | TikTok profile URL                                                                                                         | Array of strings |
| `company_social_twitter_urls`                     | Enriched   | Twitter profile URL                                                                                                        | Array of strings |
| `company_social_x_urls`                           | Enriched   | X profile URL                                                                                                              | Array of strings |
| `company_social_youtube_urls`                     | Enriched   | YouTube channel/profile URL                                                                                                | Array of strings |
| `company_social_github_urls`                      | Enriched   | Github page/profile URL                                                                                                    | Array of strings |
| `company_social_reddit_urls`                      | Enriched   | Reddit profile URL                                                                                                         | Array of strings |

{% tabs %}
{% tab title="Social media and websites" %}
{% code title="Social media and websites" %}

```json
 "company_websites_main_original": "http://www.example-company.com.",
 "company_websites_main": "https://example-company.com.",
 "company_websites_facebook": "https://www.facebook.com/example-company",
 "company_websites_twitter": "https://www.twitter.com/example-company",
 "company_websites_professional_network": "https://www.professional_network.com/company/example-company-international-limited",
 "company_websites_professional_network_canonical": "https://www.professional_network.com/company/example-company-international-limited",
```

{% endcode %}
{% endtab %}

{% tab title="Company social links" %}
{% code title="Company social links" %}

```json
"company_social_discord_urls": [
    "https://discord.gg/example-company"
],
"company_social_facebook_urls": [
    "https://www.facebook.com/example-company"
],
"company_social_instagram_urls": [
    "https://www.instagram.com/example_company"
],
"company_social_professional_network_urls": [
    "https://www.professional_network.com/company/example-company"
],
"company_social_pinterest_urls": [
    "https://www.pinterest.com/example_company"
],
"company_social_tiktok_urls": [
    "https://www.tiktok.com/@example_company"
],
"company_social_twitter_urls": [
    "https://twitter.com/example_company"
],
"company_social_x_urls": [
    "https://www.example-company-x.com"
],
"company_social_youtube_urls": [
    "https://www.youtube.com/c/example-company"
],
"company_social_github_urls": [
    "https://github.com/example-company"
],
"company_social_reddit_urls": [
    "https://www.reddit.com/user/example_company"
]
```

{% endcode %}
{% endtab %}
{% endtabs %}

<details>

<summary>Cleaning and enriching actions</summary>

| Data field                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | Cleaning/enriching action                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |
| ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `company_websites_main`                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | <ul><li>Every <code>company\_website\_main\_original</code> URL is resolved;</li><li>Each URL we collect is parsed, parameters are removed and added to the <code>company\_websites\_main</code> column. URL format in values is seen as <code>\<protocol>://\<domain>.\<tld>/\<path></code>;</li><li>Only one company can have a unique <code>\<domain>.\<tld>/\<path></code>. If multiple companies have the same URL, we assign it to the company that has the highest number of employees;</li><li>Expired domains are removed;</li><li>Additional enrichment actions are completed</li></ul> |
| `company_websites_twitter`                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | If \<domain> (from `company_websites_main`) == `twitter`, we move the URL value to `company_websites_twitter`*.*                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| `company_websites_facebook`                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | If \<domain> (from `company_websites_main`) == `facebook`, we move the URL value to `company_websites_facebook`.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| `company_websites_professional_network`                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | If \<domain> (from `company_websites_main`) == `professional_network`, we move the URL value to `company_websites_professional_network`.                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
| <p><code>company\_social\_discord\_urls</code>,<br><code>company\_social\_facebook\_urls</code>,<br><code>company\_social\_instagram\_urls</code>,<br><code>company\_social\_professional\_network\_urls</code>,<br><code>company\_social\_pinterest\_urls</code>,<br><code>company\_social\_tiktok\_urls</code>,<br><code>company\_social\_twitter\_urls</code>,<br><code>company\_social\_x\_urls</code>,<br><code>company\_social\_youtube\_urls</code>,<br><code>company\_social\_github\_urls</code>,<br><code>company\_social\_reddit\_urls</code></p> | URLs taken from the official company website.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |

</details>

***

## Location

| Data field                          | Processing | Description                                                                                             | Data type        |
| ----------------------------------- | ---------- | ------------------------------------------------------------------------------------------------------- | ---------------- |
| `company_location_hq_raw_address`   | Cleaned    | Detailed company location                                                                               | String           |
| `company_location_hq_country`       | Cleaned    | Headquarters country                                                                                    | String           |
| `company_location_hq_country_iso_2` | Enriched   | ISO 2-letter country code for company headquarters location                                             | String           |
| `company_location_hq_country_iso_3` | Enriched   | ISO 3-letter country code for company headquarters location                                             | String           |
| `company_location_hq_state`         | Enriched   | Company headquarters state                                                                              | String           |
| `company_location_hq_city`          | Enriched   | Company headquarters city                                                                               | String           |
| `company_location_hq_regions`       | Enriched   | Geographical region(s) the company is associated with based on the `company_location_hq_country` value. | String           |
| **`company_locations_full`**        | Raw        | Full company location information                                                                       | Array of objects |
| `location_address`                  | Raw        | Company HQ location                                                                                     | String           |
| `is_primary`                        | Raw        | Marks if the listed location is the primary                                                             | Boolean          |
| `city`                              | Enriched   | Location city                                                                                           | String           |
| `state`                             | Enriched   | Location state                                                                                          | String           |
| `country_code`                      | Enriched   | Country code                                                                                            | String           |
| `country`                           | Enriched   | Country                                                                                                 | String           |
| `country_iso_2`                     | Enriched   | ISO 2-letter code of the location country                                                               | String           |
| `country_iso_3`                     | Enriched   | ISO 3-letter code of the location country                                                               | String           |
| **`regions`**                       | Enriched   | Regions list                                                                                            | Struct           |
| `region`                            | Enriched   | Region                                                                                                  | String           |

{% code title="Locations" %}

```json
"company_location_hq_raw_address": "Los Angeles, CA, United States",
"company_location_hq_country": "United States",
"company_location_hq_country_iso_2": "US",
"company_location_hq_country_iso_3": "USA",
"company_location_hq_state": "CA",
"company_location_hq_city": "Exampleville",
"company_location_hq_regions": "[Northern America, Northern America, AMER]",
"company_locations_full": [
   {
      "location_address": "Sample St; Exampleville, CA, USA",
      "is_primary": true,
      "city": "Exampleville",
      "state": "CA",
      "country": "United States",
      "country_iso_2": "US",
      "country_iso_3": "USA",
      "regions": [
        {
            "region": "Northern America"
        }
   }
], 
```

{% endcode %}

<details>

<summary>Cleaning actions</summary>

| Data field                | Cleaning action                                                                                                                                                                                                                                                                                                         |
| ------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `location_hq_country`     | Values *\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]* are replaced with value `None`.                                                                                                                                                                                                    |
| `location_hq_raw_address` | <ul><li>Values <em>\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]</em> are replaced with value <code>None</code>;</li><li>Special trailing characters trimmed; </li><li>Value <code>company\_location\_hq\_country</code> added to the end of the string (separated by a comma).</li></ul> |

</details>

***

## Funding information

| Data field                   | Processing | Description                                                         | Data type        |
| ---------------------------- | ---------- | ------------------------------------------------------------------- | ---------------- |
| `company_funding_rounds`     |            | Funding round details                                               | Array of objects |
| `last_round_investors_count` | Cleaned    | The number of investors that participated in the last funding round | Number (integer) |
| `total_rounds_count`         | Cleaned    | Total number of funding rounds                                      | Number (integer) |
| `last_round_type`            | Cleaned    | Last funding round type                                             | String           |
| `last_round_date`            | Cleaned    | Last funding round date                                             | String           |
| `last_round_money_raised`    | Cleaned    | Total funds raised                                                  | number (integer) |
| `financial_website_url`      | Raw        | Financial website URL of the last funding round                     | String           |

{% code title="Funding information" %}

```json
 "company_funding_rounds": [
        {
            "last_round_investors_count": 5,
            "total_rounds_count": 3,
            "last_round_type": "Series A",
            "last_round_date": "2020-11-10",
            "last_round_money_raised": 15600000,
            "financial_website_url": "https://www.financial_website.com/funding_round/example-company-series-a--f1687fe3"
        }
    ]
}
```

{% endcode %}

<details>

<summary>Cleaning actions</summary>

| Data field                   | Cleaning action                                                                                                                                                                                                                                     |
| ---------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `company_funding_rounds`     | <ul><li>Duplicate data fields filtered out;</li><li>Removed empty/irrelevant data fields.</li></ul>                                                                                                                                                 |
| `last_round_investors_count` | <ul><li>Values <em>\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]</em> are replaced with value <code>0</code>;</li><li>Every value is converted to an integer.</li></ul>                                               |
| `total_rounds_count`         | <ul><li>Values <em>\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]</em> are replaced with value <code>0</code>;</li><li>Every value is converted to an integer.</li></ul>                                               |
| `last_round_type`            | Values *\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]* are replaced with value `0`.                                                                                                                                   |
| `last_round_date`            | Value is converted to the *yyyy-mm-dd* format.                                                                                                                                                                                                      |
| `last_round_money_raised`    | <ul><li>Values <em>\["None"; "Unknown"; "NaN"; "nan"; "na"; "null"; "Null"; "NULL"; "-"; "--"]</em> are replaced with value <code>0</code>;</li><li>Every value is converted to an integer (integer value is parsed from the text value).</li></ul> |

</details>

***

## Technologies

| Data field             | Processing | Description                                                                                            | Data type        |
| ---------------------- | ---------- | ------------------------------------------------------------------------------------------------------ | ---------------- |
| `company_technologies` | Enriched   | Technologies used by the company                                                                       | Array of structs |
| `technology`           | Enriched   | Technology name                                                                                        | String           |
| `first_verified_at`    | Enriched   | <p>Date this technology was first assigned to the company.<br>Date format: <code>YYYY-MM-DD</code></p> | String (date)    |
| `last_verified_at`     | Enriched   | <p>Date this technology was last assigned to the company.<br>Date format: <code>YYYY-MM-DD</code></p>  | String (date)    |

{% code title="Technologies" %}

```json
"company_technologies": [
    {
      "technology": "React",
      "first_verified_at": "2022-03-15",
      "last_verified_at": "2025-02-15"
    }
  ]
```

{% endcode %}

<details>

<summary>Enriching actions</summary>

| Data field             | Enriching action                                |
| ---------------------- | ----------------------------------------------- |
| `company_technologies` | Enriched by our ML model from multiple sources. |

</details>

***

## Supporting fields

| Data field         | Processing | Description                                                                                                                                          | Data type |
| ------------------ | ---------- | ---------------------------------------------------------------------------------------------------------------------------------------------------- | --------- |
| `expired_domain`   | Enriched   | <p>Indicates that the <code>company\_websites\_main\_original</code><br>URL redirects to a domain dealer</p>                                         | Integer   |
| `unique_subdomain` | Enriched   | Indicates that only the record company owns the subdomain                                                                                            | Integer   |
| `unique_domain`    | Enriched   | Indicates that only this company has the right to have this unique domain, e.g., `company_websites_main:` `https://ibm.com`                          | Integer   |
| `unique_website`   | Enriched   | Indicates that only this company has a unique website but not necessarily a unique domain, e.g., `company_websites_main: https://ibm.com/generation` | Integer   |

{% code title="Supporting fields" %}

```json
    "expired_domain": 0,
    "unique_domain": 1,
    "unique_subdomain": 1,
    "unique_website": 0,
```

{% endcode %}

***

## Company updates

| Data field                      | Processing | Description                                                                       | Data type        |
| ------------------------------- | ---------- | --------------------------------------------------------------------------------- | ---------------- |
| `company_updates`               |            | Company posts and related details                                                 | Array of objects |
| `urn`                           | Raw        | <p>String-based identifier<br></p>                                                | String           |
| `followers`                     | Raw        | Number of followers                                                               | String           |
| `date`                          | Raw        | <p>Post publish date<br>(e.g., 1 month ago)</p>                                   | String           |
| `description`                   | Raw        | <p>Published text</p><p><strong>Note:</strong> may contain control characters</p> | String           |
| `reactions_count`               | Raw        | Number of reactions on the post                                                   | Integer          |
| `comments_count`                | Raw        | Number of comments on the post                                                    | Integer          |
| `reshared_post_author`          | Raw        | Reshared post author                                                              | String           |
| `reshared_post_author_url`      | Raw        | Author's profile URL                                                              | String           |
| `reshared_post_author_headline` | Raw        | Author's headline                                                                 | String           |
| `reshared_post_description`     | Raw        | Reshared post text                                                                | String           |
| `reshared_post_followers`       | Raw        | The number of followers of the reshared post author                               | Integer          |
| `reshared_post_date`            | Raw        | <p>Date the reshared post was published<br>(e.g., 1 month ago)</p>                | String           |

{% code title="Company updates" %}

```json
"company_updates_collection": [
      {
        "urn": "urn:pn:activity:6991335602751201281",
        "followers": 1371,
        "date": "1mo",
        "description": "Example description",
        "reactions_count": 22,
        "comments_count": 2,
        "reshared_post_author": "John Doe",
        "reshared_post_author_url": "https://www.professional_network.com/in/john-doe",
        "reshared_post_author_headline": "Co-Founder at Example Company, TEDx & Keynote Speaker",
        "reshared_post_description": "Example description",
        "reshared_post_followers": 45,
        "reshared_post_date": "1mo"
      }
  ]
```

{% endcode %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.coresignal.com/company-data/clean-company-data/dictionary-clean-company-data.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
