Llama 2 G G M L Medical Chatbot

ThisIs-Developer

Introduction

The Llama-2-GGML-Medical-Chatbot is an advanced AI-driven medical chatbot designed for answering medical questions, providing summaries, and generating medical text using the Llama-2-7B-Chat-GGML model. It is based on the Gale Encyclopedia of Medicine and is suitable for patients, healthcare professionals, and researchers. However, it should not be used as a substitute for professional medical advice.

Architecture

The chatbot leverages the Llama-2-7B-Chat-GGML large language model, which is fine-tuned and quantized with the q2_K method. The model file is named llama-2-7b-chat.ggmlv3.q2_K.bin, with a size of 2.87 GB and requiring a maximum of 5.37 GB RAM. It uses different tensor types such as GGML_TYPE_Q4_K and GGML_TYPE_Q2_K for various operations. The system is powered by Langchain and Chainlit, and operates on a minimum hardware setup of an Intel® Core™ i3 CPU with 8 GB RAM.

Training

The chatbot is trained using the Gale Encyclopedia of Medicine, Volume 1, 2nd Edition, which is a comprehensive 637-page PDF. This training set allows the chatbot to cover a wide range of medical topics.

Guide: Running Locally

  1. Open Git Bash.
  2. Navigate to the desired directory where you want to clone the repository.
  3. Clone the repository by running:
    git clone https://github.com/ThisIs-Developer/Llama-2-GGML-Medical-Chatbot.git
    
  4. Install required packages by executing:
    pip install -r requirements.txt
    
  5. Run the chatbot using:
    chainlit run model.py -w
    

For enhanced performance, consider using cloud GPUs from providers like AWS, GCP, or Azure.

License

The Llama-2-GGML-Medical-Chatbot is licensed under the MIT License.

More Related APIs in Question Answering