Run Recipe

Once a recipe is created and saved, you need to publish it to assign it a version number. A Run button is enabled after the recipe has been published.

Recipe "run" process starts the data ingestion pipeline, executing tasks to connect to data source, pre-process the data and create a knowledge base with vector embedding as per the configurations in the recipe elements.

Configurations related to the prompt and output elements are not relevant for the recipe run.

Following processing metrics are displayed on the recipe dashboard, highlighting each task in the data ingestion process.

  • X-axis: Processing tasks

    • OCR (Optical Character Recognition): Extraction of text from images or scanned documents.

    • PII (Personally Identifiable Information): Identification and handling of sensitive personal data such as names, addresses, or social security numbers.

    • Chunking: Division of text into smaller, meaningful parts or "chunks" for analysis or processing.

    • Embeddings: Conversion of text data into numerical format for machine learning algorithms by mapping words or phrases to vectors in a high-dimensional space

  • Y-axis: Count of processed items

If errors occur during the recipe run, error messages are displayed in the recipe panel and can also be visualized as error counts in the dashboard.

You can also review the summary of the run, including a list of connectors with embedded items and chunks.

Last updated