Commit Graph

32 Commits

Author SHA1 Message Date
pseco cd3922abbd modified config.py 2026-03-11 10:41:28 +01:00
pseco f5b2df94d2 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-10 14:40:32 +01:00
pseco 4c56dc29c4 Add initial Jupyter notebook for document ingestion using Ollama embeddings
- Implemented code to utilize OllamaEmbeddings for embedding documents.
- Included example usage with sample text inputs.
- Demonstrated response handling from the Ollama LLM.
- Noted deprecation warning for the Ollama class in LangChain.
2026-03-10 14:40:27 +01:00
acano bf3c7f36d8 feat(chunk): enhance file reading and processing logic
- Updated `read_files` function to return a list of dictionaries containing 'content' and 'title' keys.
- Added logic to handle concatenation of file contents and improved handling of file prefixes.
- Introduced `get_chunk_docs` function to chunk document contents using `SemanticChunker`.
- Added `convert_chunks_to_document` function to convert chunked content into `Document` objects.
- Integrated logging for chunking process.
- Updated dependencies in `uv.lock` to include `chonkie` and other related packages.
2026-03-10 14:36:09 +01:00
pseco 4afba7d89d working on scrappy 2026-03-09 15:00:07 +01:00
pseco 11e6ef71b1 working on agent pseco scratches 2026-03-09 09:35:49 +01:00
pseco 183c04829c Update changelog for version 1.2.0: add new modules, refactor server integration, and enhance dependency management 2026-03-05 11:00:30 +01:00
pseco 9079674114 working on retrieve from ES 2026-03-04 13:58:38 +01:00
pseco 63c5fc976f working on Dual Index 2026-03-03 15:07:53 +01:00
pseco 9575af3ff0 working on dual index 2026-03-03 12:01:03 +01:00
pseco 8297ae204c added lark to notebook 2026-03-03 09:39:09 +01:00
pseco e01e424fac workin on llm_factory 2026-02-26 18:02:46 +01:00
pseco 77751ee8ac working on langgraph agent v2 2026-02-26 11:35:00 +01:00
pseco 4a2db004c0 adding ragas to dev 2026-02-26 09:45:42 +01:00
pseco dfdf94f604 working on ragas 2026-02-25 17:28:35 +01:00
pseco 12eef38f33 count tokens files 2026-02-25 17:17:20 +01:00
pseco 71cb79985c added config and count tokens 2026-02-25 14:59:57 +01:00
pseco cdd5f45ae1 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-02-24 16:59:38 +01:00
pseco d6ac7aa1ca working on evaluating embeddings 2026-02-24 16:53:37 +01:00
acano 4b5352d93c Refactor code structure for improved readability and maintainability 2026-02-24 16:24:32 +01:00
acano 0d6c08e341 Add BEIR analysis notebook for CosQA and update dependencies
- Created a new Jupyter notebook for analyzing BEIR dataset with CosQA using Ollama embeddings.
- Implemented a custom embedding class to integrate LangChain's OllamaEmbeddings with BEIR.
- Added data loading and evaluation logic for the CosQA dataset.
- Updated `uv.lock` to remove unnecessary dependencies (`mteb` and `polars`) and incremented revision number.
2026-02-24 15:27:59 +01:00
pseco ff438ea6c4 update makefile 2026-02-24 14:52:48 +01:00
pseco 9b6726c232 working on evaluatin embeddings 2026-02-24 14:35:48 +01:00
pseco 397dc7602b working on embeddings evaluation 2026-02-24 12:09:51 +01:00
pseco bb56222013 working on ragas 2026-02-24 10:51:04 +01:00
pseco 0b75c3254c Refactor code structure for improved readability and maintainability 2026-02-19 17:09:35 +01:00
pseco 4b0be0b80b feat: add retrieval functionality and update execution counts in notebooks 2026-02-19 12:45:36 +01:00
pseco 51488b3ee6 working on ingestion 2026-02-19 11:54:31 +01:00
pseco 1a77b84921 working on retrieve 2026-02-18 16:23:03 +01:00
pseco 26603a9f45 feat: add chunking methods and ingestion process for Elasticsearch 2026-02-18 14:51:52 +01:00
pseco 2ec64b1472 working on ingestion 2026-02-18 13:52:01 +01:00
pseco 36bd3b32a6 generate working schema 2026-02-16 17:58:18 +01:00