Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feat] ChatGPT Integration Part 2: LLM API #283

Open
trangiabach opened this issue Sep 24, 2023 · 1 comment
Open

[Feat] ChatGPT Integration Part 2: LLM API #283

trangiabach opened this issue Sep 24, 2023 · 1 comment
Assignees

Comments

@trangiabach
Copy link
Contributor

trangiabach commented Sep 24, 2023

This is related to #279. An article for reference.

  • For each course, create an LLM configuration object that stores the prompt, model type + other relevant configs. There will also be a general prompt applied to all LLM configuration object to tune the LLM to the task of responding to office hours questions based on course materials

  • Create an API endpoint to edit these configs (prompt engineering the LLM)

  • Create an API endpoint that given a search query, calls the VectorDB API to return relevant documents, ingest the documents into the LLM using the OpenAI API, query the OpenAI API using the search query and return the response

  • Since this API is priced based on usage, log the amount of usage along with the cost for each course using the LLM.

  • Implement rate-limits on the LLM Answering API

@krrishdholakia
Copy link

Hey @trangiabach - i'm the maintainer of LiteLLM (https://github.com/BerriAI/litellm)

a library to simplify calling multiple LLM APIs (OpenAI, Anthropic, Palm, Replicate, etc.) with a drop-in replacement for the openai-python sdk.

We also have an opensource proxy server that i think might help deal with some of the issues you've outlined, especially around rate limits.

Here's the code - https://github.com/BerriAI/liteLLM-proxy

Let me know if this is helpful - happy to contribute if necessary!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

3 participants