Overview

Product Comparison

6min

Not sure which product to choose? Compare the differences:

  1. Record count. We offer millions of records going back to 2016.
  2. File formats and size. We offer data in JSONL, Parquet, and CSV formats.
  3. Processing levels. We offer Multi-source, Clean, and Base data.
  4. Delivery formats. We provide data as datasets (flat files) or data API.

1. Record count

Coresignal's products have various processing levels.

Data source

Record count

Scraping since

Base Company Data

73M+

July 2016

Clean Company Data

40M+

August 2023

Multi-source Company Data

39M+

July 2016

Base Employee Data

773M+

July 2016

Clean Employee Data

723M+

October 2023

Multi-source Employee Data

725M+

July 2016

Base Job Posting Data

348M+

August 2020

2. File formats and size

Coresignal provides data in JSONL, Parquet, and CSV formats. Each format will result in a differently sized file.

JSONL/Parquet or CSV?

  • Data in JSONL/Parquet format is sorted by country. In the JSONL/Parquet format delivery, data is organized by country, with each country having its own folder. Inside each folder, you'll find multiple .json.gz or .gz.parquet files. A single JSONL file can contain up to 100,000 records.
  • Data in CSV format delivery is sorted by entity type. In the CSV format delivery, data is organized by entity type, following a schema-based structure. Each folder represents a different entity, and related information is stored separately, requiring joins to establish connections between datasets. A single CSV file can contain up to 10 million records.


3. Processing levels

Coresignal provides three types of processing levels: Multi-source, Clean, and Base data.

Multi-source, Clean, or Base?

  • Multi-source datasets contain cleaned and enriched data combining information from multiple sources.
  • Clean datasets are derived from our Base data and cleaned to ensure the best quality.
  • Base datasets freshly scraped and structured/updated for easier use.


4. Delivery formats

Coresignal delivers data as datasets (flat files) or data APIs. Each method has it's benefits – please check the information below.

Dataset or data API?

  • Datasets. Good choice if you need a large dataset for a large-scale project or seek to use years of historical data. You can choose to get regular daily, weekly or monthly updates or one-time delivery.
  • Data APIs. You can set up data API and access fresh records on demand. Good choice if you want to integrate data into your product, easily enrich data, and need more flexibility accessing records.

Incremental or full delivery?

Incremental delivery includes records that were updated that particular month, while full delivery will include all available records.