New: Try the Public Octivas Playground — test search, scrape & crawl live. No sign up needed. Try it now
Use Case
AI model training data
Collect and clean web text for pre-training, fine-tuning, and evaluation — with crawl and scrape controls that fit your stack.
Perfect for
Foundation & post-training teams
Sample diverse domains with controlled crawl breadth and depth.
RLHF & eval builders
Pair prompts with fresh web evidence for reward models and judges.
Applied research labs
Build domain-specific corpora without operating undifferentiated browser infra.
Data compliance partners
Respect robots and site policies while documenting what was collected.
How it works
Search → crawl → scrape → export to your training jobs.
Discover seed URLs
Use /search to find candidate pages and domains for your dataset spec.
Search API docsCrawl at scale
Use /crawl with include paths for sites and sections you are licensed to use.
Crawl API docsNormalize text
Use /scrape for markdown and metadata to strip boilerplate before chunking.
Scrape API docsShip to training
Push raw or cleaned text to object storage and your preprocessing jobs — track provenance per URL.
Discover seed URLs
Use /search to find candidate pages and domains for your dataset spec.
Search API docsCrawl at scale
Use /crawl with include paths for sites and sections you are licensed to use.
Crawl API docsNormalize text
Use /scrape for markdown and metadata to strip boilerplate before chunking.
Scrape API docsShip to training
Push raw or cleaned text to object storage and your preprocessing jobs — track provenance per URL.
Start free, scale as you grow
1,000 free API credits every month. No credit card required.
For new creators
- 1,000 API credits /month
- No credit card required
- Email support
Built for enthusiasts
- 4,000 API credits / month
- Higher rate limits
- Email support
For professionals
- 100,000 API credits / month
- More concurrent requests
- Email support
Enterprise
Power at your pace with custom solutions
- Scrape unlimited pages
- Custom concurrent requests
- Dedicated support & SLA
- Bulk discounts
Ready to feed your training pipeline?
Sign up in seconds. Your first 1,000 API credits are on us — no credit card required.