# Data Dictionary: Chrome Store Companies

Documentation contains explanations and examples for all data fields available in the **Chrome Store** **Companies** dataset.

{% hint style="info" %}
All personal/company information mentioned within this context is entirely fictional and is solely intended for illustrative purposes.
{% endhint %}

{% tabs %}
{% tab title="Data fields per category" %}

1. [Metadata](#metadata)
2. [Product information](#product-information)
3. [Reviews](#reviews)
4. [Related extensions](#related-extensions)
   {% endtab %}
   {% endtabs %}

{% hint style="info" %}
The data fields in the example snippets have been rearranged for better grouping. To see where a specific data field stands, check the full data sample [here](https://docs.coresignal.com/additional-sources/chrome-store/chrome-store-companies/data-sample).
{% endhint %}

***

## Metadata

| Data field             | Description                                              | Data type        |
| ---------------------- | -------------------------------------------------------- | ---------------- |
| `meta`                 | Contains metadata about the record                       | Object           |
| `created_at_date`      | Date when the record was initially scraped               | Array of numbers |
| `created_at_timestamp` | Unix timestamp for when the record was first scraped     | Number           |
| `updated_at_date`      | Date when the record was last updated                    | Array of numbers |
| `updated_at_timestamp` | Unix timestamp for when the record was last updated      | Number           |
| `version_id`           | Identifier for the dataset version                       | String           |
| `source`               | Source from which the record was obtained                | String           |
| `object`               | Data object type                                         | String           |
| `is_deleted`           | Indicates if the record is available on the Chrome Store | Boolean          |

**See a snippet of the dataset for reference:**

{% code title="Metadata" %}

```json
	{
		"_meta": {
			"created_at_date": [
				2023,
				10,
				13
			],
			"created_at_timestamp": 1697178153.75913,
			"updated_at_date": [
				2024,
				1,
				12
			],
			"updated_at_timestamp": 1705059501.547118,
			"version_id": "aff195c9",
			"source": "chromestore",
			"object": "company",
			"is_deleted": false
		},
```

{% endcode %}

## Product information

### Product metadata

| Data field  | Description                               | Data type |
| ----------- | ----------------------------------------- | --------- |
| `id`        | Record identifier in our database         | String    |
| `source_id` | Record identifier on Chrome Web Store     | String    |
| `url`       | URL to the extension page on Chrome Store | String    |

**See a snippet of the dataset for reference:**

{% code title="Metadata" %}

```json
"id": "chromestore_company_chrome-extension/ooablmjjcdbdjhhjkaffpbjnanonjgnm",
"source_id": "chrome-extension/ooablmjjcdbdjhhjkaffpbjnanonjgnm",
"url": "https://chrome.google.com/webstore/detail/chrome-extension/ooablmjjcdbdjhhjkaffpbjnanonjgnm",
```

{% endcode %}

### Profile details

| Data field    | Description                            | Data type        |
| ------------- | -------------------------------------- | ---------------- |
| `name`        | Product name                           | String           |
| `website_url` | Product website (outside Chrome Store) | String           |
| `logo_url`    | Logo URL                               | String           |
| `category`    | Software category on Chrome Store      | Array of strings |

**See a snippet of the dataset for reference:**

{% code title="Product information" %}

```json
"name": "chrome-extension",
"website_url": "https://chrome.google.com/webstore/category/ext/1-communication",
"logo_url": "https://lh3.googleusercontent.com/p1HIJPf9Y8KacaXX0z9rJOBWvnp454555ffYK3EWlFWQjF--l-pGX-SWGyducI3A5we_y_srptH875NdyS3tKGIeYpynpx6E=w128-h128-e344365-rj-sc0x00ffffff",
"category": [
            "Social & Communication"
        ],
```

{% endcode %}

### Product overview

| Data field        | Description                    | Data type |
| ----------------- | ------------------------------ | --------- |
| `overview`        | Extention overview             | Object    |
| `main_image`      | Logo URL                       | String    |
| `description`     | Description                    | String    |
| `version`         | Version available for download | String    |
| `last_updated`    | Last update date               | String    |
| `size`            | Extension size                 | String    |
| `languages_count` | Supported language count       | Integer   |

**See a snippet of the dataset for reference:**

{% code title="Product overview" %}

```json
"overview": {
            "main_image": "https: //lh3.googleusercontent.com/p1HIJPf9Y8KacaXX0z9rJOBWvnp454555ffYK3EWlFWQjF--l-pGX-SWGyducI3A5we_y_srptH875NdyS3tKGIeYpynpx6E=w128-h128-e344365-rj-sc0x00ffffff",
            "description": "Overview\nStop seeing HN stories on offending topics\nSource Available on github\n\nTired of seeing political stories in the HN list? Not interested in the latest JavaScript framework? Had enough of the Rust lovefest?\nRead more\nRead less",
            "version": "0.4",
            "last_updated": "July 19, 2017",
            "size": "7.31KiB",
            "languages_count": 5
        },
```

{% endcode %}

### Product stats

| Data field     | Description                                             | Data type |
| -------------- | ------------------------------------------------------- | --------- |
| `rating`       | Overview of privacy practices employed by the developer | Float     |
| `review_count` | Total number of reviews                                 | Integer   |
| `users_count`  | Total number of users                                   | Integer   |

**See a snippet of the dataset for reference:**

{% code title="Product information" %}

```json
    "rating": 4.9,
    "review_count": 16,
    "users_count": 30,
```

{% endcode %}

### Product privacy practices

| Data field               | Description                                             | Data type |
| ------------------------ | ------------------------------------------------------- | --------- |
| `privacy_practices_text` | Overview of privacy practices employed by the developer | String    |

**See a snippet of the dataset for reference:**

{% code title="Privacy practises" %}

```json
"privacy_practices_text": "Privacy practices\nThe developer has disclosed that it will not collect or use your data\nTo learn more, see the developer's privacy policy\nThis developer declares that your data is:\nNot being sold to third parties, outside of the \n approved use cases \nNot being used or transferred for purposes that are unrelated to the item's core functionality\nNot being used or transferred to determine creditworthiness or for lending purposes",
```

{% endcode %}

## Reviews

| Data field | Description               | Data type        |
| ---------- | ------------------------- | ---------------- |
| `reviews`  | List of extension reviews | Array of objects |
| `author`   | Review author             | String           |
| `date`     | Review date               | String           |
| `rating`   | Rating in the review      | Float            |
| `text`     | Review content            | String           |

**See a snippet of the dataset for reference:**

{% code title="Review sources" %}

```json
			"reviews": [
				{
					"author": "John Doe",
					"date": "Aug 1, 2023",
					"rating": 4,
					"text": "It's a good extension, but at the moment its not working or loading at all. I'd like to have easy access to watch the new en girls but it just keeps loading forever."
				}
			],
```

{% endcode %}

## Related extensions

| Data field           | Description                | Data type        |
| -------------------- | -------------------------- | ---------------- |
| `related_extensions` | List of related extensions | Array of objects |
| `name`               | Similar extension name     | String           |
| `review_count`       | Total number of reviews    | Integer          |
| `rating`             | Overall rating             | Float            |
| `url`                | Extension page URL         | String           |

**See a snippet of the dataset for reference:**

{% code title="Related extensions" %}

```json
"related_extensions": [
            {
                "name": "social-extension",
                "review_count": 260,
                "rating": 4.3,
                "url": "https://chrome.google.com/webstore/detail/social-extension/odhjcgnffdddlbagjllfbilicalpigimhdclldffd"
            }
        ]
```

{% endcode %}
