Additional sources
GitHub Users
Data Dictionary
44min
dictionary contains explanations and examples of all data fields available in the github users dataset all personal/company information mentioned within this context is entirely fictional and is solely intended for illustrative purposes metadata https //docs coresignal com/source documentation/github users data dictionary#lmj2b developer's profile details https //docs coresignal com/source documentation/github users data dictionary#jlgao developer's repositories https //docs coresignal com/source documentation/github users data dictionary#n3xtk starred repositories https //docs coresignal com/source documentation/github users data dictionary#xgg 9 developer's subscriptions https //docs coresignal com/source documentation/github users data dictionary#rvj5u the data points in the example snippets have been rearranged for better grouping to see where a specific data point stands, check the full data sample here https //docs coresignal com/source documentation/github users data sample metadata record metadata data point description data type meta contains information about the record o bject created at date the date when we first scraped the record a rray of numbers (integers) created at timestamp the date we first scraped the record (unix time) float updated at date the date when we last scraped the record a rray of numbers (integers) updated at timestamp the date when we last scraped the record (unix time) float version id dataset version id string source the record source string object the data object/entity string is deleted marks if the user is available on github boolean see a snippet of the dataset for reference metadata " meta" { 	 "source" "github", 	 "object" "user", 	 "created at date" \[ 	 2021, 	 10, 	 24 	 ], 	 "created at timestamp" 1635086276 668947, 	 "updated at date" \[ 	 2024, 	 5, 	 1 	 ], 	 "updated at timestamp" 1714523931 292325, 	 "version id" "e0f2c272" 	 }, user profile metadata data point description data type doc dataset starting point o bject source id user profile identifier on github s tring id record identifier in our database s tring site admin indicates if the user is the site admin b oolean type record type s tring see snippets of the dataset for reference user profile metadata "doc" { 	"source id" 3487953, 	"id" "github people 3487953", 	"site admin" false, 	"type" "user", data point description data type events url github rest api response s tring node id identification key assigned by github rest api s tring see snippets of the dataset for reference user profile metadata "events url" "https //api github com/users/dev/events{/privacy}", "node id" "mdq6vxnlcjm0odc5ntm=", developer's profile details data point description data type image developer's avatar/logo s tring bio developer's bio note contains control characters s tring url developer's github profile url s tring location developer's location s tring username developer's username s tring name developer's name note not necessarily the same as the username s tring see a snippet of the dataset for reference developer profile details "image" "https //avatars githubusercontent com/u/34879666553?v=4", "bio" "example bio", "url" "https //github com/dev", "location" "united states", "username" "dev", "name" "john doe", public c ontact information data point description data type contact info publicly accessible contact information o bject blog developer's blog s tring twitter developer's twitter handle s tring see a snippet of the dataset for reference contact information "contact info" { 	 "blog" "https //example blog be", 	 "twitter" "example handle" 	 }, developer a ffiliation data point description data type company company affiliated with the developer s tring hireable marks if the developer is open for hire boolean/null see snippets of the dataset for reference developer affiliation "company" "development company", "hireable" true, organizations data point description data type organization organizations the developer is connected to a rray of objects description organization description note may contain control characters s tring source id organization identification key on github s tring username organization name s tring node id organization identifier assigned by github rest api s tring url information on the organization returned by the github rest api s tring see a snippet of the dataset for reference organization "organization" \[ 	{ 	 "description" "example organization description", 	 "source id" 70442962, 	 "username" "it organization", 	 "node id" "mdeyok9yz2fuaxphdglvbjcwndqyotyy", 	 "url" "https //api github com/orgs/it organization" 	} ], following data point description data type follower count developer's follower count integer following count number of people the developer follows integer see a snippet of the dataset for reference following "follower count" 14, "following count" 28, data point description data type followed by developer's followers a rray of objects username follower's username s tring source id user identifier on github integer url follower's github profile url s tring see a snippet of the dataset for reference followers "followed by" \[ 	{ 	 "username" "data science dev", 	 "source id" 11396652, 	 "url" "https //github com/data science dev" 	} ], data point description data type is following followed users a rray of objects username followee's username s tring source id user identifier on github integer url followee's github profile s tring see a snippet of the dataset for reference following "is following" \[ 	{ 	 "username" "developer", 	 "source id" 630670, 	 "url" "https //github com/developer" 	} ] gists and repos data point description data type public gist count number of public gists by the developer integer public repo count number of public developer's repositories integer see a snippet of the dataset for reference gists and repos "public gist count" 0, "public repo count" 2, developer's repositories data point description data type repo developer's public repositories a rray of objects disabled indicates if the repository was disabled at the time of the last scrape b oolean archived indicates if the repository is archived and no longer accessible b oolean created at timestamp when the repository was created in iso 8601 format s tring (date) default branch default branch title s tring description repository description note may contain control characters s tring fork marks if the repository in a record is a copy of another repository b oolean fork count n umber of repository copies integer forked from link to the original repository s tring has downloads indicates if other users have downloaded the repository b oolean has issues marks if the repository has the issues section enabled b oolean has pages marks if the repository has the pages section enabled b oolean has projects marks if the repository has the projects section enabled b oolean has wiki marks if the repository has the wiki section enabled b oolean website project website s tring url repository url s tring source id repository identifier on github integer see a snippet of the dataset for reference developer's repositories "repo" \[ 	{ 	 "disabled" false, 	 "archived" false, 	 "created at" "2020 09 25t14 00 48z", 	 "default branch" "master", 	 "description" "fork of example repo", 	 "fork" false, 	 "fork count" 0, 	 "forked from" "https //www github com/example repo ", 	 "has downloads" true, 	 "has issues" true, 	 "has pages" false, 	 "has projects" true, 	 "has wiki" true, 	 "website" "www example repo website com", 	 "url" "https //github com/dev/example repo 2", 	 "source id" 298588957, data point description data type open issues count number of open issues in the repository integer pushed at timestamp when the repository was published in iso 8601 format s tring (date) size repository size integer stargazer count number of people who have starred the repository integer updated at timestamp when the repository was updated in iso 8601 format s tring watcher count number of people who are following the repository updates integer topics repository topics a rray of strings see a snippet of the dataset for reference developer's repositories "open issues count" 47, "pushed at" "2022 11 01t18 21 42z", "size" 15938, "stargazer count" 7249, "updated at" "2021 05 18t03 32 01z", "watcher count" 7249, "topics" \[ 	 "v2 ui", 	 "x ui", 	 "xray", 	 "xray core", 	 "xray panel" 	 ] languages in the repo data point description data type language main programming language in the repository st ring languages distribution the distribution of languages in the repository (percentage) o bject see a snippet of the dataset for reference programming languages "language" "javascript", "languages distribution" { 	 "javascript" 58 2, 	 "vue" 37 9, 	 "scss" 3 0, 	 "html" 0 9 	 }, repository details data point description data type repo name repository title s tring repo owner repository owner s tring name repository name s tring node id repository identifier assigned by github rest api s tring see a snippet of the dataset for reference repository owner "repo name" "example repo 2", "repo owner" "dev", "name" "example repo 2", "node id" "mdewoljlcg9zaxrvcnkyotg1odg5ntc=", used licenses data point description data type license open source licenses the repository uses o bject key github url identifying license s tring name license name s tring spdx id spdx license id s tring url url redirecting to github info on licensing s tring node id license identifier assigned by github rest api s tring see a snippet of the dataset for reference license "license" { 	"key" "example tool", 	"name" "example rool license 2 0", 	"spdx id" "example tool 2 0", 	"url" "https //api github com/licenses/example tool 2 0", 	"node id" "mdc6tgljzw5zzti=" }, repository owner data point description data type owner r epository owner o bject image developer's logo/avatar s tring url developer's profile s tring source id developer's identifier on github integer username developer's username s tring node id developer's identifier assigned by github rest api s tring site admin marks if the user is the site admin b oolean type user's profile type s tring see a snippet of the dataset for reference repository owner "owner" { 	"image" "https //avatars githubusercontent com/u/3487953?v=4", 	"url" "https //github com/dev", 	"source id" 3487953, 	"username" "dev", 	"node id" "mdq6vxnlcjm0odc5ntm=", 	"site admin" false, 	"type" "user" }, starred repositories data point description data type starred repositories the developer starred a rray of objects disabled indicates if the repository was disabled at the time of the last scrape b oolean archived shows if the repository is archived and no longer accessible b oolean created at timestamp when the repository was created in iso 8601 format s tring default branch default branch title s tring description repository description note may contain control characters s tring fork marks if the repository in a record is a copy of another repository b oolean fork count number of repository copies integer forked from link to the original repository s tring has downloads indicates if other users have downloaded the repository b oolean has issues marks if the repository has the issues section enabled b oolean has pages marks if the repository has the pages section enabled b oolean has projects marks if the repository has the projects section enabled b oolean has wiki marks if the repository has the wiki section enabled b oolean website project website s tring url repository url s tring source id repository identifier on github integer see a snippet of the dataset for reference starred repositories "starred" \[ 	{ 	 "disabled" false, 	 "archived" false, 	 "created at" "2019 11 15t18 32 42z", 	 "default branch" "main", 	 "description" "example tool for api", 	 "fork" true, 	 "fork count" 3309, 	 "forked from" "https //www github com/example tool for api ", 	 "has downloads" true, 	 "has issues" true, 	 "has pages" false, 	 "has projects" false, 	 "has wiki" false, 	 "website" "https //example api tool dev", 	 "url" "https //github com/example api tool", 	 "source id" 221981891, 	 "language" "typescript", data point description data type open issues count number of open issues in the repository integer pushed at timestamp when the repository was published in iso 8601 format s tring (date) size repository size integer stargazer count number of people who have starred the repository integer updated at timestamp when the repository was updated in iso 8601 format string (date) watcher count number of people who are following the repository updates integer topics repository topics a rray of strings see a snippet of the dataset for reference starred repositories "open issues count" 0, 	 "pushed at" "2020 09 25t14 03 40z", 	 "size" 232, 	 "stargazer count" 0, 	 "updated at" "2020 09 25t14 00 48z", 	 "watcher count" 0, 	 "topics" \[ 	 "android", 	 "android apps" 	 ] 	 } languages in the repo data point description data type language main programming language in the repository s tring languages distribution languages and their distribution in the repository by percentage o bject see a snippet of the dataset for reference programming languages "language" "python", "languages distribution" { 	 "python" 95 3, 	 "starlark" 4 7 	 }, repository details data point description data type repo name repository title s tring repo owner repository owner s tring name repository name s tring node id repository identifier assigned by github rest api s tring see a snippet of the dataset for reference repository owner "repo name" "example repo 2", "repo owner" "dev", "name" "example repo 2", "node id" "mdewoljlcg9zaxrvcnkyotg1odg5ntc=", used licenses data point description data type license open source licenses the repository uses o bject key github url identifying license s tring name license name s tring spdx id spdx license id s tring url url redirecting to github info on licensing s tring node id license identifier assigned by github rest api s tring see a snippet of the dataset for reference license "license" { 	"key" "example tool", 	"name" "example rool license 2 0", 	"spdx id" "example tool 2 0", 	"url" "https //api github com/licenses/example tool 2 0", 	"node id" "mdc6tgljzw5zzti=" }, repository owner data point description data type owner r epository owner o bject image developer's logo/avatar s tring url developer's profile s tring source id developer's identifier on github integer username developer's username s tring node id developer's identifier assigned by github rest api s tring site admin marks if the user is the site admin b oolean type user's profile type s tring see a snippet of the dataset for reference developer information "owner" { 	"image" "https //avatars githubusercontent com/u/61577764722?v=4", 	"url" "https //github com/big it company", 	"source id" 6154722, 	"username" "big it company", 	"node id" "mdeyok9yz2fuaxphdglvbjyxntq3mji=", 	"site admin" false, 	"type" "organization" }, developer's subscriptions data type description data type subscription repositories the developer is subscribed to a rray of objects disabled indicates if the repository was disabled at the time of the last scrape b oolean archived indicates if the repository is archived and no longer accessible b oolean created at timestamp when the repository was created in iso 8601 format string (date) default branch default branch title string description repository description note may contain control characters string fork marks if the repository in a record is a copy of another repository b oolean fork count number of repository copies integer forked from link to the original repository string has downloads indicates if other users have downloaded the repository b oolean has issues marks if the repository has the issues section enabled b oolean has pages marks if the repository has the pages section enabled b oolean has projects marks if the repository has the projects section enabled b oolean has wiki marks if the repository has the wiki section enabled b oolean website project website string url repository url string source id repository identifier on github integer see a snippet of the dataset for reference developer's subscriptions "subscription" \[ 	{ 	 "disabled" false, 	 "archived" false, 	 "created at" "2019 05 29t08 01 32z", 	 "default branch" "main", 	 "description" "example tool for data science", 	 "fork" false, 	 "fork count" 263, 	 "forked from" null, 	 "has downloads" true, 	 "has issues" true, 	 "has pages" true, 	 "has projects" true, 	 "has wiki" true, 	 "website" "https //github io/developer/example data tool/", 	 "url" "https //github com/developer/example data tool", 	 "source id" 189179081, data point description data type open issues count number of open issues in the repository integer pushed at timestamp when the repository was published in iso 8601 format s tring (date) size repository size integer stargazer count number of people who have starred the repository integer updated at timestamp when the repository was updated in iso 8601 format string (date) watcher count number of people who are following the repository updates integer topics repository topics a rray of strings see a snippet of the dataset for reference developer's repositories "open issues count" 47, "pushed at" "2022 11 01t18 21 42z", "size" 15938, "stargazer count" 7249, "updated at" "2021 05 18t03 32 01z", "watcher count" 7249, "topics" \[ 	 "v2 ui", 	 "x ui", 	 "xray", 	 "xray core", 	 "xray panel" 	 ] languages in the repo data point description data type language main programming language in the repository s tring languages distribution languages and their distribution in the repository by percentage o bject see a snippet of the dataset for reference language "language" "javascript", "languages distribution" { 	 "javascript" 99 9, 	 "html" 0 1, 	 "css" 0 0 	 }, repository details data point description data type repo name repository title s tring repo owner repository owner s tring name repository name s tring node id repository identifier assigned by github rest api s tring see a snippet of the dataset for reference repository owner "repo name" "example data tool", "repo owner" "developer", "name" "example data tool", "node id" "mdewoljlcg9zaxrvcnkxodkxnzkwode=", used licenses data point description data type license open source licenses the repository uses o bject key github url identifying license s tring name license name s tring spdx id spdx license id s tring url url redirecting to github info on licensing s tring node id license identifier assigned by github rest api s tring see a snippet of the dataset for reference license "license" { 	"key" "example tool", 	"name" "example rool license 2 0", 	"spdx id" "example tool 2 0", 	"url" "https //api github com/licenses/example tool 2 0", 	"node id" "mdc6tgljzw5zzti=" }, repository owner data point description data type owner r epository owner o bject image developer's logo/avatar s tring url developer's profile s tring source id developer's identifier on github integer username developer's username s tring node id developer's identifier assigned by github rest api s tring site admin marks if the user is the site admin b oolean type user's profile type s tring see a snippet of the dataset for reference owner "owner" { 	"image" "https //avatars githubusercontent com/u/61577764722?v=4", 	"url" "https //github com/big it company", 	"source id" 6154722, 	"username" "big it company", 	"node id" "mdeyok9yz2fuaxphdglvbjyxntq3mji=", 	"site admin" false, 	"type" "organization" },