assistance-engine

Commit Graph

Author	SHA1	Message	Date
acano	ed25f15542	feat: Enhance Elasticsearch ingestion process with metadata export - Added output path parameter to elasticsearch_ingestion command for exporting processed documents. - Implemented ElasticHandshakeWithMetadata class to preserve chunk metadata during ingestion. - Updated process_documents function to include extra metadata for each chunk. - Modified ingest_documents function to return Elasticsearch response for each chunk. - Introduced export_documents function to save processed documents as JSON files.	2026-03-12 12:26:47 +01:00
acano	46a6344c45	Add docstrings to elasticsearch_ingestion and ingest_documents functions for improved documentation	2026-03-12 09:53:56 +01:00
acano	189e404d21	Refactor Elasticsearch ingestion and document processing functions for improved clarity and functionality	2026-03-12 09:50:30 +01:00
acano	5f21544e0b	Refactor Elasticsearch ingestion pipeline and add MBPP generation script - Updated `elasticsearch_ingestion.py` to streamline document processing and ingestion into Elasticsearch. - Introduced `generate_mbap.py` for generating benchmark problems in AVAP language from a provided LRM. - Created `prompts.py` to define prompts for converting Python problems to AVAP. - Enhanced chunk processing in `chunk.py` to support markdown and AVAP documents. - Added `OllamaEmbeddings` class in `embeddings.py` for handling embeddings with Ollama model. - Updated dependencies in `uv.lock` to include new packages and versions.	2026-03-11 17:17:44 +01:00
acano	bf3c7f36d8	feat(chunk): enhance file reading and processing logic - Updated `read_files` function to return a list of dictionaries containing 'content' and 'title' keys. - Added logic to handle concatenation of file contents and improved handling of file prefixes. - Introduced `get_chunk_docs` function to chunk document contents using `SemanticChunker`. - Added `convert_chunks_to_document` function to convert chunked content into `Document` objects. - Integrated logging for chunking process. - Updated dependencies in `uv.lock` to include `chonkie` and other related packages.	2026-03-10 14:36:09 +01:00
acano	6d856ba691	Add chunk.py for processing and replacing JavaScript references with Avap - Implemented `replace_javascript_with_avap` function to handle text replacement. - Created `read_concat_files` function to read and concatenate files with a specified prefix, replacing JavaScript markers. - Added functionality to read files from a specified directory and process their contents.	2026-03-09 13:21:18 +01:00

6 Commits