I created a highly personalised large language model with Nvidia's entertaining Chat with RTX app but at 60GB+ I'm now beginning to wonder if it's worth keeping around

Jacob Ridley

February 13, 2024 at 9:03 AM·8 min read

Nvidia's Chat with RTX software in use, which offers a personally trained LLM.

Owners of RTX 40- and 30-series graphics cards can now setup their own personalised large language model (LLM) on their own PC. It's one that's eminently capable of sifting through old documents or distilling down the essence of YouTube videos.

Chat with RTX is now available to download from Nvidia's website for free from today, February 13. It works with any current or last generation graphics card with at least 8GB or more VRAM, which includes every desktop card bar the RTX 3050 6GB and excludes a few mid- to low-end laptop GPUs. It also requires 50—100GB of storage space on your PC, depending on the AI models downloaded.

There are two models to choose from: Mistral or Llama 2. The default is Mistral, and I'd recommend sticking with that.

The key parts of Chat with RTX are retrieval-augmented generation (RAG) and TensorRT-LLM. The former means you're able to give the LLM information and it which it will use alongside its internal training to generate accurate responses to your queries. The latter builds TensorRT engines that can exploit the silicon in Nvidia's GeForce GPUs to more efficiently run AI applications.

The result is an LLM you can feed your own data to (.txt, .pdf, and .doc filetypes) and which you can then query on that data.

For example, I've been playing around with the tool these past few days and since I create a lot of documents as a part of this job this feels like the prime dataset to stuff into its gaping maw. So I've set up Chat with RTX on my RTX 4080 powered PC (install size of 61.7GB) and fed the Mistral model over 1,300 of wonderful prose (ahem, or rather my news article drafts). I then set about asking it some questions.

First, I asked 'Could you name the articles where I mention Nvidia?'

Out comes the above response listing three articles with their file path. Now, I've definitely spoken about Nvidia more than three times in 1,300 articles, so let's give that another try.

I ask again, rewording the query a little, 'Could you list every article in which I mention Nvidia?'

This time eight articles are listed, this time with the Google Doc titles listed. I've mentioned Nvidia many more times than that, but you get the general idea of how this all works. Each response does appear grounded in truth, with each response citing the data used to generate it, if not always the whole truth. Simply using the Windows search function within the article dataset brings up 128 drafts including the term 'Nvidia' in the title, let alone the body copy.

Another example is if I ask Chat with RTX to tell me how many times I've used the word cheese, it tells me I've never used the phrase, citing an untitled and unrelated document as the source for the information. Nevertheless, it's probably right about the cheese thing. Until now, anyways.

Yet the tool is more exciting once you start asking it to summarise large quantities of information down into single bite-sized responses.

I asked Chat with RTX whether I should buy an Intel Core i9 14900K, and it came back to me with a trimmed version of my own 14900K review, which succinctly summarised it to "Based on the review, it appears that the Intel Core i9 14900K may not be worth the extra cost compared to the Core i9 13900K".

Couldn't put it better myself.

I also asked Chat with RTX to summarise an article I wrote a while back on Alpine's F1 esports team, which it explained succinctly, and then to tell me about Intel's Meteor Lake processors, which I knew were covered a few times in the articles within the dataset.

Image 1 of 3

Image 2 of 3

Image 3 of 3

Oh, and I asked it who I was. This was more to make myself feel important as the LLM returned a description of me in near-enough the same words as I used to describe myself for my site bio. Theoretically, you could just feed Chat with RTX a thousands documents on how great you are and create a narcissist's dream software.

Not that I'd do that, nope.

It's the summarising of large datasets that I could see this tool being useful for. Though I doubt everyone has such a need for that. The average PC user might not fancy a 100GB app to tell them what they already know. But, say you're working with a huge amount of responses to a survey and you want to quickly get an idea of the general thoughts and feelings of those who answered, this is one easy way to do it. But it is best used with caution and as only a guide to the inputted dataset, not as a way to accurately analyse it fully.

The other people it might appeal to are those that prefer to keep their content out of the cloud. The idea of asking an AI hosted God-knows-where to handle files that could contain sensitive information, or manuscripts for your big action movie idea, isn't that appealing to many. We've already seen what this looks like when it goes wrong courtesy of Samsung's employees. That's why a locally-run tool such as this might instead appeal.

The other use for Chat with RTX is to feed it YouTube videos then query it on the contents. I grabbed an episode of Chat Log, a podcast hosted by my colleagues Lauren Morton and Mollie Taylor, and fed it into the machine. The episode is titled 'Does the Steam Deck suit out PC gaming lifestyle so far?'

I asked, 'Is the Steam Deck easy to use day-to-day?' and a response prints out that sums up Lauren and Mollie's conversation with Tyler Colp on the matter.

I also then asked the obvious question, 'Does the Steam Deck suit their PC gaming lifestyle so far?' The response:

This feature works by downloading the transcript of the YouTube video, ingesting it, and using RAG to respond appropriately to a user's questions. It definitely appears to generate good summaries of a YouTube videos with plenty of talking, though due to the reliance of transcripts, you can't feed it anything that relies on visual information. Feed it the Grand Theft Auto VI trailer with next to no words throughout and you'll get nothing out of it.

I'm not so sure about the YouTube usage. On the one hand, I could see it being useful for a summary of a long live stream or event which you don't have time to watch yourself, though it's a chunky application to have around for those few instances where that's a thing. Similarly, the YouTube creator doesn't appear to get a view out of this, and I tend to fall in the 'AIs scraping information from creators online and offering nothing in exchange will break the very core of the internet as we know it' camp. This application alone might not make much of a difference, but I do strongly believe if you want the information provided by someone, you should at least support them creating more like it.

Anyways, the YouTube stuff takes a backseat for me with Chat with RTX. It's the mass local text file digestion that feels the most important piece of the software. As an application, it's pretty snappy. It generates responses swiftly once you hit send on a query. Though it does appear to gobble up around 85% of my VRAM—you need to be sure to close it properly with the off switch to release that back to the PC once you're done with it.

Chat with RTX is a fun concept, and a good way for Nvidia to show what's inference locally on its GeForce cards can do, but I'm not sure I'm going to keep it around on my PC. For one, it's absolutely massive due to the huge model data, but more so because the actual practical uses are pretty limited for me, personally.

Perhaps some clever clogs will come up with new and exciting ways to put it into practice now that it's available to the world. That could be you, providing you have the proper hardware. You can download Chat with RTX to try for yourself today.

Yahoo Sports
2024 NFL Draft grades: Denver Broncos earn one of our lowest grades mostly due to one pick
Yahoo Sports' Charles McDonald breaks down the Broncos' 2024 draft.
9h ago
Yahoo Sports
NFL Draft: Packers fan upset with team's 1st pick, and Lions fans hilariously rubbed it in
Not everyone was thrilled with their team's draft on Thursday night.
3d ago
Yahoo Sports
NFL Draft: Bears take Iowa punter, who immediately receives funny text from Caleb Williams
There haven't been many punters drafted in the fourth round or higher like Tory Taylor just was. Chicago's No. 1 overall pick welcomed him in unique fashion.
1d ago
Yahoo Sports
NFL to allow players to wear protective Guardian Caps in games beginning with 2024 season
The NFL will allow players to wear protective Guardian Caps during games beginning with the 2024 season. The caps were previously mandated for practices.
2d ago
Yahoo Sports
NFL Draft: Spencer Rattler's long wait ends, as Saints draft him in the 5th round
Spencer Rattler once looked like a good bet to be a first-round pick.
1d ago
Yahoo Sports
Cowboys owner Jerry Jones compared his 2024 NFL Draft strategy to robbing a bank
Dallas Cowboys owner Jerry Jones made an amusing analogy when asked why the team selected three offensive lineman in the 2024 NFL Draft.
1d ago
Yahoo Sports
Korey Cunningham, former NFL lineman, found dead in New Jersey home at age 28
Cunningham played 31 games in the NFL with the Cardinals, Patriots and Giants.
2d ago
Yahoo Sports
2024 NFL Draft grades: Kansas City Chiefs get even richer with one of the best hauls this year
Yahoo Sports' Charles McDonald breaks down the Chiefs' 2024 draft.
9h ago
Yahoo Sports
Michael Penix Jr. said Kirk Cousins called him after Falcons' surprising draft selection
Atlanta Falcons first-round draft pick Michael Penix Jr. said quarterback Kirk Cousins called him after he was picked No. 8 overall in one of the 2024 NFL Draft's more puzzling selections.
2d ago
Yahoo Sports
Panthers owner David Tepper stopped by Charlotte bar that criticized his draft strategy
“Please Let The Coach & GM Pick This Year" read a sign out front.
2d ago
Yahoo Sports
2024 NFL Draft grades: Baltimore Ravens do what they do best — let good players fall into their laps
Yahoo Sports' Charles McDonald breaks down the Ravens' 2024 draft.
10h ago
Yahoo Sports
NBA playoffs: Tyrese Hailburton game-winner and potential Damian Lillard Achilles injury leaves Bucks in nightmare
Tyrese Haliburton hit a floater with 1.1 seconds left in overtime to give the Indiana Pacers a 121–118 win over the Milwaukee Bucks. The Pacers lead their first-round playoff series two games to one.
2d ago
Yahoo Sports
2024 NFL Draft grades: Minnesota Vikings risked a lot to get J.J. McCarthy and Dallas Turner
Yahoo Sports' Charles McDonald breaks down the Vikings' 2024 draft.
6h ago
Yahoo Sports
New Bills WR Keon Coleman makes hilarious first impression with Macy's shopping advice and more
If nothing else, the Bills have a player who can recognize a good deal.
20h ago
Yahoo Sports
Dave McCarty, player on 2004 Red Sox championship team, dies 1 week after team's reunion
The Red Sox were already mourning the loss of Tim Wakefield from that 2004 team.
8d ago
Yahoo Sports
2024 NFL Draft grades: Green Bay Packers' stockpile of picks put to good use
Yahoo Sports' Charles McDonald breaks down the Packers' 2024 draft.
6h ago
Yahoo Sports
LeBron James and the Lakers avoid a first-round sweep, but the clock is ticking in L.A.
The 11-game losing streak the Lakers had to the Denver Nuggets is over, delaying the inevitable questions about LeBron's future.
18h ago
Yahoo Sports
NFL Draft: Adonai Mitchell says he's 'kind of pissed' after slide to Colts in second round
Many people had the Texas WR as a first-round prospect.
2d ago
Yahoo Sports
Texans WR Tank Dell sustains 'minor injury' in mass shooting, was reportedly 1 of 10 shot at Florida nightclub
Tank Dell is in "good spirits" after being reportedly being "caught in the crossfire" of the shooting that left 10 injured at a Florida nightclub.
3h ago
Yahoo Sports
Bengals All-Pro Trey Hendrickson reportedly joins Tee Higgins in requesting trade
Hendrickson is coming off a career-high 17.5-sack season.
4d ago

News

Life

Entertainment

Finance

Sports

New on Yahoo

I created a highly personalised large language model with Nvidia's entertaining Chat with RTX app but at 60GB+ I'm now beginning to wonder if it's worth keeping around

Recommended Stories

2024 NFL Draft grades: Denver Broncos earn one of our lowest grades mostly due to one pick

NFL Draft: Packers fan upset with team's 1st pick, and Lions fans hilariously rubbed it in

NFL Draft: Bears take Iowa punter, who immediately receives funny text from Caleb Williams

NFL to allow players to wear protective Guardian Caps in games beginning with 2024 season

NFL Draft: Spencer Rattler's long wait ends, as Saints draft him in the 5th round

Cowboys owner Jerry Jones compared his 2024 NFL Draft strategy to robbing a bank

Korey Cunningham, former NFL lineman, found dead in New Jersey home at age 28

2024 NFL Draft grades: Kansas City Chiefs get even richer with one of the best hauls this year

Michael Penix Jr. said Kirk Cousins called him after Falcons' surprising draft selection

Panthers owner David Tepper stopped by Charlotte bar that criticized his draft strategy

2024 NFL Draft grades: Baltimore Ravens do what they do best — let good players fall into their laps

NBA playoffs: Tyrese Hailburton game-winner and potential Damian Lillard Achilles injury leaves Bucks in nightmare

2024 NFL Draft grades: Minnesota Vikings risked a lot to get J.J. McCarthy and Dallas Turner

New Bills WR Keon Coleman makes hilarious first impression with Macy's shopping advice and more

Dave McCarty, player on 2004 Red Sox championship team, dies 1 week after team's reunion

2024 NFL Draft grades: Green Bay Packers' stockpile of picks put to good use

LeBron James and the Lakers avoid a first-round sweep, but the clock is ticking in L.A.

NFL Draft: Adonai Mitchell says he's 'kind of pissed' after slide to Colts in second round

Texans WR Tank Dell sustains 'minor injury' in mass shooting, was reportedly 1 of 10 shot at Florida nightclub

Bengals All-Pro Trey Hendrickson reportedly joins Tee Higgins in requesting trade