- Added new dependencies including chonkie and markdown-it-py to requirements.txt. - Refactored the Elasticsearch ingestion script to read and concatenate documents from specified folders. - Implemented semantic chunking for documents using the chonkie library. - Removed the old elasticsearch_ingestion_from_docs.py script as its functionality has been integrated into the main ingestion pipeline. - Updated README.md to reflect new project structure and environment variables. - Added a new changelog entry for version 1.4.0 detailing recent changes and enhancements. |
||
|---|---|---|
| .. | ||
| elasticsearch_ingestion.py | ||
| es_ingestion.ipynb | ||
| evaluate_retrieve.ipynb | ||
| generate_synthethic_data.ipynb | ||
| langgraph_agent_simple.ipynb | ||
| test_chunker.ipynb | ||