Run LLama 2 on GPU

I want to run LLama2 on a GPU since it takes forever to create answers with CPU. I have access to a nvidia a6000 through a jupyter notebook. I have installed everything and the responses are fine but it takes so long and isn't fast enough for my research purposes.

import torchimport transformersfrom transformers import LlamaForCausalLM, LlamaTokenizerimport setGPUmodel_dir = "llama/llama-2-7b-chat-hf"model = LlamaForCausalLM.from_pretrained(model_dir)tokenizer = LlamaTokenizer.from_pretrained(model_dir)pipeline = transformers.pipeline("text-generation",    model=model,    tokenizer=tokenizer,    torch_dtype=torch.float16,)sequences = pipeline('I wanna hear some news. What is up today',    do_sample=True,    top_k=10,    num_return_sequences=1,    eos_token_id=tokenizer.eos_token_id,    max_length=400,)for seq in sequences:    print(f"{seq['generated_text']}")

This is my current code. When I try nvidia-smi in terminal, the GPU is always at 0% whereas the CPU RAM increases extremely, so it's 100% running on the CPU. How can I make it run on the GPU?

I have made this tutorial directly from meta: https://ai.meta.com/blog/5-steps-to-getting-started-with-llama-2/

Run LLama 2 on GPU

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...