Release v2.1
Notes
We’re excited to unveil Retrieve, a powerful new capability that seamlessly bridges web crawling and indexing. With Retrieve, every crawled resource can now be automatically added to an index, unlocking instant compatibility with Retrieval-Augmented Generation (RAG) workflows.
This means you can now crawl, index, and query your data - all within WDS! Whether you need to search across a single domain or your entire crawled dataset, WDS has you covered. You can access it effortlessly via REST API or through MCP Tools bringing unified data navigation, intelligent search, and deep analysis into one integrated ecosystem.
Together, these features make Web Data Source (WDS) a complete solution for exploring, searching, and leveraging your organization’s knowledge with both public and private LLMs.
For technical details, explore the documentation for the components below to learn how everything fits together.
Added
- Retrieval
- New APIs
- New parameters
- Convert parameter has been added for all scrape-based functions
- New MCP tools
- UpsertJobConfig
- StartExistingJob
- CrawlMdrConfigSetMaxDepth
- JobFetch
- CrawlAllMdr
- GetJobsInfo
- GetJobConfig
- JobConfigRetrievalConfigSetEnrollInIndex
- JobConfigRetrievalConfigSetMaxTokensPerChunk
- JobConfigRetrievalConfigUpsertContentScope
- JobConfigRetrievalConfigSetWaitForEnrollment
- JobConfigRetrievalConfigSetForce
- New MCP propmpts
Modified
- Helm Chart:
- WDS MCP tools registration
- All WDS MCP tools are now registering with the
Wdsprefix in their names
- All WDS MCP tools are now registering with the
Removed
Nothing
Known Issues
| Issue | Mitigation |
|---|---|
| Core Services use the libtarry Grpc.Core of version 2.46.6 (the latest one) that has the following vulnerability GHSA-9hxf-ppjv-w6rq | WDS doesn’t expose gRPC endpoints so none of them is available outside an internal network that is used for inter-services communication. This issue is going to be fixed (GitHub issue CVE-2023-4785: Backport #33656 to 1.46.x to fix gRPC Core) and then a new WDS version will be released with the issue fix |