New: Try the Public Octivas Playground — test search, scrape & crawl live. No sign up needed. Try it now

Use Case

AI model training data

Collect and clean web text for pre-training, fine-tuning, and evaluation — with crawl and scrape controls that fit your stack.

Perfect for

Foundation & post-training teams

Sample diverse domains with controlled crawl breadth and depth.

RLHF & eval builders

Pair prompts with fresh web evidence for reward models and judges.

Applied research labs

Build domain-specific corpora without operating undifferentiated browser infra.

Data compliance partners

Respect robots and site policies while documenting what was collected.

How it works

Search → crawl → scrape → export to your training jobs.

Discover seed URLs

Use /search to find candidate pages and domains for your dataset spec.

Search API docs

Crawl at scale

Use /crawl with include paths for sites and sections you are licensed to use.

Crawl API docs

Normalize text

Use /scrape for markdown and metadata to strip boilerplate before chunking.

Scrape API docs

Ship to training

Push raw or cleaned text to object storage and your preprocessing jobs — track provenance per URL.

Start free, scale as you grow

1,000 free API credits every month. No credit card required.

Free
Free/month

For new creators

  • 1,000 API credits /month
  • No credit card required
  • Email support
Hobby
$19/month

Built for enthusiasts

MonthlyYearly
  • 4,000 API credits / month
  • Higher rate limits
  • Email support
Standard
$89/month

For professionals

MonthlyYearly
  • 100,000 API credits / month
  • More concurrent requests
  • Email support

Enterprise

Power at your pace with custom solutions

Custom credits
Custom
Contact sales
  • Scrape unlimited pages
  • Custom concurrent requests
  • Dedicated support & SLA
  • Bulk discounts

Ready to feed your training pipeline?

Sign up in seconds. Your first 1,000 API credits are on us — no credit card required.