Bulk Collect

Overview

Discover the Bulk Collect (Bulk API) capabilities and explore potential uses for efficiently retrieving company data in batches. Find all Bulk Collect related information in the following topic:

Endpoints

Bulk Collect (Bulk API) is an extension of Clean Company API and includes four POST and two GET endpoints.

Collect company data in bulk using an company ID list, or Elasticsearch DSL schema that is already used in the Clean Comapny API.

Request type
Endpoint

POST

/v2/data_requests/company_clean/ids

POST

/v2/data_requests/company_clean/es_dsl

POST

/v2/data_requests/company_clean/shorthand_names

POST

/v2/data_requests/company_clean/urls

GET

/v2/data_requests/{data_request_id}/files

GET

/v2/data_requests/{data_request_id}/files/{file_name}

Limiting returned record count

Include the parameter "limit": int to control the number of records returned by your queries in /v2/data_requests/company_clean/es_dsl endpoints.

Request example to retrieve five records

curl -X 'POST' \
  'https://api.coresignal.com/cdapi/v2/data_requests/company_clean/es_dsl' \
  -H 'accept: application/json' \
  -H 'apikey: {API Key}' \
  -H 'Content-Type: application/json' \
  -d '{
  "webhook_url": "{optional_webhook_url}",
  "limit": 5,
  "es_dsl_query": '{
   "query":{
      "bool":{
         "must":[
            {
               "query_string":{
                  "query":"2023",
                  "default_field":"founded",
                  "default_operator":"and"
               }
            }
         ]
      }
   }
}'

Credits

Your credits for Clean Company API will also apply to Bulk Collect data collection requests.

Learn about the credits in Bulk Collect usage in the general Bulk Collect topic.

Rate limits

Bulk Collect endpoints have limited requests allowed per second. Learn about credits in Bulk Collect usage in the main Clean Company API topic.

Webhooks

POST endpoints allow you to add webhooks and get notified when your data request is ready.

Keep in mind that webhook_url is optional.

Elasticsearch DSL template
{
  "webhook_url": "{optional_webhook_url}",
  "es_dsl_query": {}
}

Last updated

Was this helpful?