Skip to content

Building a Custom Vector Search Engine with Weaviate : The project discusses the architecture of Weaviate, an open-source vector database and provides a tutorial implementation of a custom vector search engine using Weaviate Cloud Service(WCS).

Notifications You must be signed in to change notification settings

CS-Edwards/vector_search_engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Weaviate Vector Database Project

Weaviate Overview and Semantic Search Demo

By: Candace Edwards Email: cedward2@hawaii.edu

This is the final project for ICS691B: Big Data Analytics, Fall 2022. Submitted: 12/15/22

In this project I’ve explored the relevant architecture of the Weaviate core node, implemented a vector database using Weaviate and applied three dimensionality reduction algorithms to provide 2D and 3D visualization of the vector data.

The data for this project is gathered by scraping the UH Manoa spring ‘23 registration page and course catalog page. Ultimately, we use course information such as the course name, course description, department and instructor to build our database objects and vectorize our data.

Deliverables

  • The final project note book is: `CEdwards_ICS691B_Final_Weaviate.ipynb`
  • Python code for webscrape directory/file: `code/data_util.py`
  • Data for dataframe directory/file: `data/data_df.csv`
  • Vector embeddings numpy directory/file: `data/embeddings.npy`

Outline:

I. Project Overview
II. Weaviate Overview
----Architecture Overview
----Vectors and Vector Index
------kNN vs ANN
------HNSW
III. Vector Database Demonstration
----Data and Vectors
----Weaviate Database Instance
----Semantic Search Queries
IV. Visualizations
----PCA
----t-SNE
----UMAP
V. Conclusion

Misc:

Draft/Sandbox: working_draft.ipynb

About

Building a Custom Vector Search Engine with Weaviate : The project discusses the architecture of Weaviate, an open-source vector database and provides a tutorial implementation of a custom vector search engine using Weaviate Cloud Service(WCS).

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published