Skip to content
Snippets Groups Projects
Unverified Commit f81a5d25 authored by Henry Chen's avatar Henry Chen Committed by GitHub
Browse files

Fixes for falcon model(s) (#40)

parent e1bbb6e3
No related branches found
No related tags found
No related merge requests found
...@@ -3,7 +3,7 @@ deployment: ...@@ -3,7 +3,7 @@ deployment:
image: quay.io/chenhunghan/ialacol:latest image: quay.io/chenhunghan/ialacol:latest
env: env:
DEFAULT_MODEL_HG_REPO_ID: TheBloke/WizardLM-Uncensored-Falcon-40B-GGML DEFAULT_MODEL_HG_REPO_ID: TheBloke/WizardLM-Uncensored-Falcon-40B-GGML
DEFAULT_MODEL_FILE: wizard-falcon40b.ggmlv3.q4_K_S.bin DEFAULT_MODEL_FILE: wizardlm-uncensored-falcon-40b.ggccv1.q4_0.bin
resources: resources:
{} {}
cache: cache:
......
...@@ -3,7 +3,9 @@ deployment: ...@@ -3,7 +3,9 @@ deployment:
image: quay.io/chenhunghan/ialacol:latest image: quay.io/chenhunghan/ialacol:latest
env: env:
DEFAULT_MODEL_HG_REPO_ID: TheBloke/WizardLM-Uncensored-Falcon-7B-GGML DEFAULT_MODEL_HG_REPO_ID: TheBloke/WizardLM-Uncensored-Falcon-7B-GGML
DEFAULT_MODEL_FILE: wizard-falcon-7b.ggmlv3.q4_1.bin DEFAULT_MODEL_FILE: wizardlm-7b-uncensored.ggccv1.q4_0.bin
# the file name of the model file is missing `falcon` so we need to set the model type
MODE_TYPE: falcon
resources: resources:
{} {}
cache: cache:
......
...@@ -38,7 +38,10 @@ async def get_llm( ...@@ -38,7 +38,10 @@ async def get_llm(
ctransformer_model_type = "dolly-v2" ctransformer_model_type = "dolly-v2"
if "stablelm" in body.model: if "stablelm" in body.model:
ctransformer_model_type = "gpt_neox" ctransformer_model_type = "gpt_neox"
MODE_TYPE = get_env("MODE_TYPE", "")
if len(MODE_TYPE) > 0:
ctransformer_model_type = MODE_TYPE
MODELS_FOLDER = get_env("MODELS_FOLDER", "models") MODELS_FOLDER = get_env("MODELS_FOLDER", "models")
return AutoModelForCausalLM.from_pretrained( return AutoModelForCausalLM.from_pretrained(
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment