Docker Hub Repositories Data Dictionary
Contains explanations and examples for all data fields available in the Docker Hub Repositories dataset. All personal/company information mentioned in this data dictionary is fictional and is solely intended for illustrative purposes.
Data points in the example snippets are rearranged for better grouping. To see where a specific data point stands, check the full data sample below:
Data point | Description | Data type |
meta | Contains information about the record | |
created_at_date | The date when we first scraped the record | array of numbers |
created_at_timestamp | The date we first scraped the record (Unix time) | number |
updated_at_date | The date when we last scraped the record | array of numbers |
updated_at_timestamp | The date when we last scraped the record (Unix time) | number |
version_id | Dataset version ID | string |
source | The record source | string |
object | The data object/entity | string |
See a snippet of the dataset for reference:
A null value means that the information was not available on Docker Hub.
Data point | Description | Data type |
doc | Start of the dataset: contains first set of information points about the company | object |
source_id | Unique identifier of the record on Docker Hub | string |
id | Unique identifier of the record in our database | string |
See a snippet of the dataset for reference:
Data point | Description | Data type |
last_updated | Timestamp when the repository was last updated | string |
url | Repository information | string |
See a snippet of the dataset for reference:
Data point | Description | Data type |
publisher | Publisher's name | string |
hub_user | User tied with the repository | string |
namespace | Repository developer | string |
See a snippet of the dataset for reference:
Data point | Description | Data type |
name | Repository name | string |
See a snippet of the dataset for reference:
Data point | Description | Data type |
repository_type | Denotes repository type | string |
is_automated | Denotes if the repository automatically updates the version of the Docker image | boolean |
status | Repository status | number |
See a snippet of the dataset for reference:
Data type | Description | Data type |
star_count | Marks the number of stars the repository has | number |
pull_count | Marks the number of downloads of the repository | number |
See a snippet of the dataset for reference:
Data point | Description | Data type |
description | Introduction to the repository | string |
full_description | Repository description Note: contains control characters | string |
See a snippet of the dataset for reference:
Data type | Description | Data type |
permissions | Marks the permissions granted within the repository | object |
read | Marks if a Docker Hub user can see the repository | boolean |
write | Marks if a Docker Hub user can write in the repository post | boolean |
admin | Marks if a Docker Hub user has admin privileges within the repository | boolean |
See a snippet of the dataset for reference: