Blasting AI into the past: Modders get Llama AI working on an old Windows 98 PC

gotg llama
(Image credit: Square Enix)

Remember when you were young, your responsibilities were far fewer, and you were still at least a little hopeful about the future potential of tech? Anyway! In our present moment, nothing appears to be safe from the sticky fingers of so-called AI—and that includes nostalgic hardware of yesteryear.

Exo Labs, an outfit with the mission statement of democratising access to AI, such as large language models, has lifted the lid on its latest project: a modified version of Meta's Llama 2 running on a Windows 98 Pentium II machine (via Hackaday). Though not the latest Llama model, it's no less head-turning—even for me, a frequent AI-naysayer.

To be fair, when it comes to big tech's hold over AI, Exo Labs and I seem to be of a similarly wary mind. So, setting aside my own AI-scepticism for the moment, this is undoubtedly an impressive project chiefly because it doesn't rely on a power-hungry, very much environmentally-unfriendly middleman datacenter to run.

The journey to Llama running on ancient-though-local hardware enjoys some twists and turns; after securing the second hand machine, Exo Labs had to contend with finding compatible PS/2 peripherals, and then figure out how they'd even transfer the necessary files onto the decades-old machine. Did you know FTP over an ethernet cable was backwards compatible to this degree? I certainly didn't!

Don't be fooled though—I'm making it sound way easier than it was. Even before FTP finagling was figured out, Exo Labs had to find a way to compile modern code for a pre-Pentium Pro machine. Longer story short-ish, the team went with Borland C++ 5.02, a "26-year-old [integrated development environment] and compiler that ran directly on Windows 98." However, compatibility issues persisted with the programming language C++, so the team had to use the older incarnation of C and deal with declaring variables at the start of every function. Oof.

Then, there's the hardware at the heart of this project. For those needing a refresher, the Pentium II machine sports an itty bitty 128 MB of RAM, while a full size Llama 2 LLM boasts 70 billion parameters. Managing all of these hefty constraints, the results are even more interesting.

Unsurprisingly, Exo Labs had to craft a comparatively svelte version of Llama for this project, now available to tool around with yourself via GitHub. As a result of everything aforementioned, the retrofitted LLM features 1 billion parameters and spits out 0.0093 Tokens per second—hardly blistering, but the headline take here really is that it works at all.

Best gaming PCBest gaming laptop


Best gaming PC: The top pre-built machines.
Best gaming laptop: Great devices for mobile gaming.

Jess Kinghorn
Hardware Writer

Jess has been writing about games for over ten years, spending the last seven working on print publications PLAY and Official PlayStation Magazine. When she’s not writing about all things hardware here, she’s getting cosy with a horror classic, ranting about a cult hit to a captive audience, or tinkering with some tabletop nonsense.

Read more
DeepSeek
Today I learned I can run my very own DeepSeek R1 chatbot on just $6,000 of PC hardware and no megabucks Nvidia GPUs required
Alibaba
Forget DeepSeek R1, apparently it's now Alibaba that has the most powerful, the cheapest, the most everything-est chatbot
A digitally generated image of abstract AI chat speech bubbles overlaying a blue digital surface.
We need a better name for AI, or we risk talking past each other until actually intelligent AGI comes home mooing
CHONGQING, CHINA - OCTOBER 30: In this photo illustration - The Facebook app page is displayed on a smartphone in the Apple App Store in front of the Meta Platforms, inc. logo on October 30, 2024 in Chongqing, China. (Photo by Cheng Xin/Getty Images)
Meta might've done something useful, pioneering an AI model that can interpret brain activity into sentences with 80% accuracy
SUQIAN, CHINA - JANUARY 27, 2025 - An illustration photo shows the logo of DeepSeek and ChatGPT in Suqian, Jiangsu province, China, January 27, 2025. (Photo credit should read CFOTO/Future Publishing via Getty Images)
China's DeepSeek chatbot reportedly gets much more done with fewer GPUs but Nvidia still thinks it's 'excellent' news
SUQIAN, CHINA - JANUARY 27, 2025 - An illustration photo shows the logo of DeepSeek and ChatGPT in Suqian, Jiangsu province, China, January 27, 2025. (Photo credit should read CFOTO/Future Publishing via Getty Images)
The brass balls on these guys: OpenAI complains that DeepSeek has been using its data, you know, the copyrighted data it's been scraping from everywhere
Latest in Hardware
Nvidia headquarters
Nvidia loses over $200 billion in valuation in a single day, as Trump's tariffs continue to roll out
Nvidia RTX 5070 Founders Edition graphics card from various angles
It might be hard to imagine even worse GPU prices but the CEOs of Best Buy and Target both predict tariffs will push consumer prices up and fast
Nvidia RTX 5070 Founders Edition graphics card from various angles
Saying what we're probably all thinking, Zotac cautions, 'Do not use third-party cables, angled adapters, or other cable accessories'
Logitech G Pro PowerPlay 2 mousepad on top of another mousepad on top of a third mousepad on top of a desk
I was wrong, the Logitech G PowerPlay 2 charging mouse pad isn't smaller than the first one, it's just the official dimensions were listed incorrectly since 2021
A photo of an ASRock Z890 Taichi Lite motherboard
ASRock Z890 Taichi Lite review
DIY Perks TV and projector
This DIY 'infinite contrast' screen uses an old projector in a seriously clever way and makes monitors with full-array dimming look like absolute garbage
Latest in News
Image of Pinhead from Dead by Daylight
'He came. And now he must go,' and that's why Pinhead is leaving Dead by Daylight in April
Image of Tecumseh in Civilization 7
Civilization 7's 'first major update' tweaks balance and fixes some UI issues, but don't expect an overhaul
Jeff Jarrett headshot
Legendary 1990s publisher Acclaim is back from the dead, and a pro wrestler famous for clobbering people with a guitar is on its advisory board
Monster Hunter Wilds screen
Monster Hunter Wilds sells 8 million copies in 3 days, 'the fastest any game has done so in Capcom’s history'
Tony Hawk doing a kickflip or whatever the hell it is in the cover art for Tony Hawk's Pro Skater 3 + 4
Tony Hawk's Pro Skater 3 + 4 remake is real, and it's coming in July with new skaters, parks, music, and more
The streamer Emiru gives the peace sign to camera.
Three women livestreaming on Twitch harassed by man who then goes for them while making repeated death threats: 'This happens off-camera to women all the time'