Feature/indexing and sparse embeddings
Indexing and Cloud-based Sparse Embedding
Changes Overview
This PR introduces the following improvements and adjustments related to data indexing and embedding handling:
- Add support for indexing data directly to cloud
- Remove local sparse embedding logic
Detailed Changes
1. Cloud Integration
- Implemented functionality to index data directly into vector database
- Established connection handling and necessary API calls for uploading vectors
2. Remove Local Sparse Embedding
- Deleted local sparse embedding logic and related files/modules
- Transitioned embedding responsibilities fully to cloud services
Testing
-
Verified successful data indexing to cloud -
Tested cloud embedding integration with sample data -
Ran full pipeline to validate end-to-end indexing
Cleanup Verification
- Ensured all removed embedding code is unused and unreferenced
- Verified new cloud indexing logic works as intended
- Checked logs and cloud responses for consistency and errors