Text Generation
Transformers
Safetensors
English
mistral
Mistral
instruct
finetune
Synthetic
conversational
text-generation-inference
Instructions to use NousResearch/Genstruct-7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use NousResearch/Genstruct-7B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="NousResearch/Genstruct-7B") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("NousResearch/Genstruct-7B") model = AutoModelForCausalLM.from_pretrained("NousResearch/Genstruct-7B") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use NousResearch/Genstruct-7B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "NousResearch/Genstruct-7B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "NousResearch/Genstruct-7B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/NousResearch/Genstruct-7B
- SGLang
How to use NousResearch/Genstruct-7B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "NousResearch/Genstruct-7B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "NousResearch/Genstruct-7B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "NousResearch/Genstruct-7B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "NousResearch/Genstruct-7B", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use NousResearch/Genstruct-7B with Docker Model Runner:
docker model run hf.co/NousResearch/Genstruct-7B
Commit History
Update README.md b7b9961 verified
Update README.md 6fdc41e verified
Update README.md 1d44a2b verified
Update README.md 89c6e61 verified
Update README.md c9d5a9f verified
Update README.md 8257a1e verified
Update README.md 3ca7820 verified
Update README.md aed11b1 verified
Update README.md 260cdb3 verified
Update README.md e1a33e2 verified
Update README.md 9a0429a verified
Update README.md 4185deb verified
Create notebook.ipynb 0bf7deb verified
Update README.md 922d4a9 verified
Update README.md 01a5847 verified
Update README.md 6c767d8 verified
Upload tokenizer f68a69a verified
Upload tokenizer 495f58e verified
Create README.md 3842fa9
Upload tokenizer 5b3c04b
Upload MistralForCausalLM 80ef54d
initial commit ddc542e
Jade commited on