Formats and downloads

What are the data download options?

You can download data records by either calling our APIs directly or simply using our self-service platform.

If you collect records one by one, you will get them in JSON format. If you use the bulk download option, you can select JSONL or CSV format.

The enrichment tool allows you to upload and get data in CSV format.


What are the differences between JSON and JSONL files?

JSON (JavaScript Object Notation) files: JSON files store data as a single, hierarchical structure, typically as an object or an array within curly or simple brackets. Data in JSON files is one complete data structure.

JSONL (JSON Lines) files: JSONL files store data as individual JSON objects, each on a new line. Each line is a valid JSON object, but the file as a whole is not a valid JSON object because the objects are not enclosed within a larger array or object.

JSONL format is ideal for handling large datasets as it allows for incremental processing of data, which is useful in machine learning datasets, or when working with large volumes of records.

Please note that if you collect records one by one, you will get them in JSON format. For easier data processing, if you use the bulk download option, you can download data in JSONL or CSV format.


What is the difference between JSONL and CSV format in bulk downloads?

If you’re downloading data in bulk using the self-service platform, you can choose between JSONL and CSV formats, but the structure of these data files will differ.

JSONL files contain full historical data, while CSV files contain the most recent data. This difference affects collections of specific data fields.

Employee data: experience, education, and certifications. CSV files contain one most recent record from each collection, while JSONL files contain all data about experience, education, and certifications.

For example, if an employee is now working at Tesla and used to work at Google, the CSV file will only show Tesla.

Company data: locations, specialties, funding rounds. CSV files contain one most recent record from each collection, while JSONL files contain all data about locations, specialties, and funding rounds.

Job posting data: industries list, functions, and functions lists. CSV files contain one most recent record from each collection, while JSONL files contain full industries lists, functions, and functions lists.

To get a better understanding of these differences, download a data sample.


How do I open JSONL files?

This file format is designed for efficient processing of both large-scale and smaller datasets.

JSONL files can be opened using programming languages like Python, with JSONL-compatible libraries, or with text editors, such as Visual Studio Code or Sublime Text.


Didn't find what you were looking for?

Last updated

Was this helpful?