Solidstack Service

Single-container bundle of all core services for quick evaluation and small workloads.
Simplifies setup and reduces resource footprint, but lacks horizontal scaling and isolates less than multi-service deployments.

This service is not open-source but its image is publicly accessible on DockerHub

Configuration

The following environment variables are used to configure this service:

Name Description
MONGOMONGODB_CONNECTION_STRING Required MongoDB connection string
MONGODB_DATABASE_NAME Optional. Sets or overrides the database name.
JOB_TYPES Required Available job types for job configs
GLOBAL_EXCEPTION_RESPONSE_DELAY_MS Optional. If a server error occurred, how much time should be waited before responding with an HTTP error? This makes sense because some clients can’t wait before repeating requests on their side, so this is done on the server side. Default value is 1 second
EXTERNAL_IP_ADDRESS_CONFIGS Required A coma-separated list of external IP getter services
MIN_LOG_LEVEL Optional. Minimal log level. Default value is Information
FEATURE_FLAG_RETRIEVAL_ENABLED Optional. Turns on/off the Retrieve feature. If set to true, only MongoDB Atlas can be used as the system DB, and EMBEDDING_SERVICE_URL must be set. Default value is false
SEARCH_MODE Optional. Search mode. Supported modes: FullText, Vector, FullTextAndVector. Default: FullText
EMBEDDING_SERVICE_URL Optional. Embedding service URL. Required if FEATURE_FLAG_RETRIEVAL_ENABLED is set to true and SEARCH_MODE is Vector or FullTextAndVector
EMBEDDING_SERVICE_API_KEY Optional. Embedding service API key if applicable
EMBEDDING_SERVICE_REQUEST_TEMPLATE Optional. Request body to send to an embedding service. Default: { 'model': 'embeddinggemma', 'input': null }
EMBEDDING_SERVICE_CONTENT_JSON_PATH Optional. Json path to replace content in the template. Default: $.input
EMBEDDING_SERVICE_RESULT_JSON_PATH Optional. Json path to get a results array from the embedding service response. Default: $.embeddings
EMBEDDING_VECTORS_LENGTH Optional. Embedding vector lenght. Default 768 (the default value for the embeddinggemma model)

JobTypes

Supported job execution environments.
Default and allowed values can be restricted in Dapi or Solidstack services.
Ensure the Crawler/Solidstack service is configured to handle the selected job type.

Name Description
internet Crawl data from internet sources via request gateways (Proxy addresses, Host IP addresses, etc.)
intranet Crawl data from intranet sources with no limits

Please rotate your device to landscape mode

This documentation is specifically designed with a wider layout to provide a better reading experience for code examples, tables, and diagrams.
Rotating your device horizontally ensures you can see everything clearly without excessive scrolling or resizing.

Return to Web Data Source Home