Q: What formats does /v1/scrape return?

Five: markdown for LLM-friendly text, html for raw page source, text for stripped plain text, links for the URL graph extracted from the page, and original for the raw response without transformation. Specify in the request body.

Q: Is there a request timeout limit?

Each request has a per-call timeout parameter you can set up to 120 seconds. The default is 30 seconds for /v1/scrape and 120 seconds for /v1/scrape/batch.

Question 1

What formats does /v1/scrape return?

Accepted Answer

Five: markdown for LLM-friendly text, html for raw page source, text for stripped plain text, links for the URL graph extracted from the page, and original for the raw response without transformation. Specify in the request body.

Question 2

When should I use /v1/scrape vs /v1/scrape/smart?

Accepted Answer

Use smart when you do not know whether the target page needs JavaScript. It is faster on static pages and falls back to the full browser when needed. Use scrape directly when you already know the page is dynamic, or when you need to pass a custom action macro.

Question 3

How big can a batch be?

Accepted Answer

Up to 100 URLs per synchronous batch call. For larger jobs use the async batch endpoint, which queues the work and posts to a webhook when each chunk completes — no upper bound beyond your monthly quota.

Question 4

Does DataSonar handle JavaScript-rendered single page apps?

Accepted Answer

Yes. The default browser engine waits for network idle and DOM stability before extracting. You can also pass wait_until values for finer control and use action macros for click-and-wait patterns common to SPAs.

Question 5

What about pages that require login?

Accepted Answer

Use the action macro to drive the login form, then continue with subsequent steps in the same session. For pages that need a persistent authenticated session across many requests, talk to us about enterprise options.

Question 6

How do I scrape paginated content?

Accepted Answer

Two patterns. For numbered pagination, send the page-2, page-3, page-N URLs as a batch. For infinite-scroll pagination, use the action macro to scroll-and-wait until the load button stops appearing.

Question 7

Is there a request timeout limit?

Accepted Answer

Each request has a per-call timeout parameter you can set up to 120 seconds. The default is 30 seconds for /v1/scrape and 120 seconds for /v1/scrape/batch.

Scrape any page.
Get clean data.

Five output formats

Smart routing

Action macros

Async with webhooks

Examples that work today.

Scraping questions

Start pulling clean data in minutes.

Scrape any page. Get clean data.