README

Introduction

This codebase includes a FastAPI application that exposes an API to convert text into sparse embeddings.

It makes use of:

Transformers library to load pretrained models and tokenizers
Pydantic for data validation
FastAPI for creating a web API
Splade for creating sparse representations of text
Pinecone for managing sparse values

Usage

After installing all the necessary packages, you can start the FastAPI server by running:

uvicorn main:app --reload

You can then access the application at http://localhost:8000.

For calculating embeddings for a given text, make a POST request to http://localhost:8000/calculate-embeddings with a JSON payload like the following:

{
  "text": "your text here"
}

The response will contain the sparse representation of your text as computed by the pretrained model.

Please note that the server and the endpoint paths may vary depending on your setup.

Limits

SPLADE supports only 512 tokens at a time.

Contributing

If you wish to contribute to this project, please fork the repository and submit a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
app		app
.DS_Store		.DS_Store
Dockerfile		Dockerfile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
__init__.py		__init__.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

README

Introduction

Usage

Limits

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

mybarman/sparse-vector-embeddings

Folders and files

Latest commit

History

Repository files navigation

README

Introduction

Usage

Limits

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages