Commit Graph

24 Commits

Author SHA1 Message Date
acano bf3c7f36d8 feat(chunk): enhance file reading and processing logic
- Updated `read_files` function to return a list of dictionaries containing 'content' and 'title' keys.
- Added logic to handle concatenation of file contents and improved handling of file prefixes.
- Introduced `get_chunk_docs` function to chunk document contents using `SemanticChunker`.
- Added `convert_chunks_to_document` function to convert chunked content into `Document` objects.
- Integrated logging for chunking process.
- Updated dependencies in `uv.lock` to include `chonkie` and other related packages.
2026-03-10 14:36:09 +01:00
acano 6d856ba691 Add chunk.py for processing and replacing JavaScript references with Avap
- Implemented `replace_javascript_with_avap` function to handle text replacement.
- Created `read_concat_files` function to read and concatenate files with a specified prefix, replacing JavaScript markers.
- Added functionality to read files from a specified directory and process their contents.
2026-03-09 13:21:18 +01:00
acano a434d34676 Updated docs 2026-03-06 11:38:06 +01:00
acano d9d754bc6f Implement feature X to enhance user experience and optimize performance 2026-03-05 10:57:38 +01:00
acano 1549069f5a feat: Add Elasticsearch ingestion pipeline and document chunking functionality
- Implemented `elasticsearch_ingestion` function to handle document ingestion into Elasticsearch.
- Created `build_chunks_from_folder` function to read and clean text files, generating document chunks.
- Added logging for better traceability during the ingestion process.
- Updated `uv.lock` to include `boto3` as a new dependency.
2026-03-04 18:21:01 +01:00
acano 0538f3b5ce Refactor code structure for improved readability and maintainability 2026-03-03 17:49:27 +01:00
acano ff08d9a426 Refactor code structure for improved readability and maintainability 2026-03-03 14:38:55 +01:00
acano 5a666079a4 Refactor langgraph_agent_simple notebook execution counts and handle Langfuse client errors
- Set execution counts to null for initial cells in langgraph_agent_simple.ipynb
- Update execution counts for subsequent cells to maintain order
- Change output stream name from stdout to stderr for error handling
- Capture and log detailed error messages for failed Langfuse client authentication

Update uv.lock to manage accelerate dependency

- Remove accelerate from main dependencies
- Add accelerate to dev dependencies with version specification
- Adjust requires-dist section to reflect changes in dependency management
2026-03-02 14:07:29 +01:00
acano 48d280440c Refactor code structure for improved readability and maintainability 2026-02-27 14:45:33 +01:00
acano 10246a3046 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-02-27 08:41:10 +01:00
acano a70cd29b67 Refactor code structure for improved readability and maintainability 2026-02-27 08:39:39 +01:00
pseco b01a76e71d evaluation on acano 2026-02-24 17:03:33 +01:00
pseco f6a907911d unstage changes 2026-02-24 16:59:57 +01:00
acano 4b5352d93c Refactor code structure for improved readability and maintainability 2026-02-24 16:24:32 +01:00
acano d4d7d9d2a1 Implement feature X to enhance user experience and optimize performance 2026-02-24 15:48:06 +01:00
acano 0d6c08e341 Add BEIR analysis notebook for CosQA and update dependencies
- Created a new Jupyter notebook for analyzing BEIR dataset with CosQA using Ollama embeddings.
- Implemented a custom embedding class to integrate LangChain's OllamaEmbeddings with BEIR.
- Added data loading and evaluation logic for the CosQA dataset.
- Updated `uv.lock` to remove unnecessary dependencies (`mteb` and `polars`) and incremented revision number.
2026-02-24 15:27:59 +01:00
acano a386982722 Add display data output for corpus conversion progress in Jupyter notebook 2026-02-24 11:46:49 +01:00
acano cb16306ffb Refactor code structure for improved readability and maintainability 2026-02-24 11:46:18 +01:00
acano 4a1236f951 Refactor code structure for improved readability and maintainability 2026-02-24 10:49:52 +01:00
acano b662b9a4fa Update langgraph_agent_simple notebook: Adjust execution counts and refine AVAP tool description
- Changed execution counts for several code cells to maintain proper order.
- Updated system message to specify the role of the agent in responding to AVAP-related queries.
- Modified user input example to inquire about reserved words in AVAP.
- Enhanced AI response to include detailed information about AVAP reserved words and provided a code example demonstrating their usage.
2026-02-20 10:05:38 +01:00
acano 0c2d0b512d Implement feature X to enhance user experience and fix bug Y in module Z 2026-02-19 17:09:42 +01:00
acano 02af67fffb created notebook for langgraph testing 2026-02-19 14:48:46 +01:00
acano ba2d2dbcaa Implement feature X to enhance user experience and fix bug Y in module Z 2026-02-18 14:51:04 +01:00
acano 0dad6b1ef5 feat: add initial implementation of Elasticsearch ingestion with chunking strategies 2026-02-18 13:52:54 +01:00