Back to Newsletter Feed

The New Stack January 30, 2026 06:03

TNS Daily | January 29

View Original Article

Newsletter Content

View In Browser

Together with

January 29, 2026

When to RAG and when to fine-tune

For many engineering teams that try to bring additional context to large language models (LLMs) to serve their business needs, there has long been a sequence of techniques to try. First, you go with prompt engineering, and if that doesn’t work, you try retrieval-augmented generation (RAG), and then, if all else fails, you fine-tune your own model.

But as The New Stack contributor and Oracle senior application engineer Ibrahim Kamal argues, it’s a mistake to think of these as a linear progression.

“Instead, they represent different architectural methods for addressing different types of problems and introduce their own limitations and failure modes. Viewing them as a linear progression creates a false narrative that can lead to brittle systems that cannot adapt to changing requirements,” Kamal writes.

To decide which method will work best for a given use case and business need, Kamal proposes a decision tree with six dimensions.

Read the full piece to learn how to decide which architecture will work best for your use case.

TOP OF THE STACK

Meet Gravitino, a geo-distributed, federated metadata lake

by Heather Joslyn

The open source project, designed for AI workloads, is intended to be a single-engine, neutral control plane for metadata.

In the new world of agentic AI, the discussion has revolved around data: governance, storage, and compute. But what about metadata — the data about data? Metadata has been a second-class citizen, according to Junping (JP) Du, founder...

Secure your AI agents in production

Together with

AI agents have moved out of experimentation and into production infrastructure. Learn how Infrastructure Identity delivers the foundational control needed to secure AI at scale.

LEARN MORE

WHAT ELSE IS NEW?

Featued image for: Gemini CLI gets its hooks into the agentic development loop

AI Agents
Gemini CLI gets its hooks into the agentic development loop

by Frederic Lardinois

Featued image for: Ramp’s Inspect shows closed-loop AI agents are software’s future

AI Agents
Ramp’s Inspect shows closed-loop AI agents are software’s future

by Arjun Iyer

Featued image for: Prompting vs. RAG vs. fine-tuning: Why it’s not a ladder

AI Engineering
Prompting vs. RAG vs. fine-tuning: Why it’s not a ladder

by Ibrahim Kamal

Featued image for: Mastra empowers web devs to build AI agents in TypeScript

AI Agents
Mastra empowers web devs to build AI agents in TypeScript

by Loraine Lawson

Featued image for: Kubernetes telemetry feature fully compromises clusters

Kubernetes
Kubernetes telemetry feature fully compromises clusters

by Joab Jackson

Featued image for: With Auto Browse, Google Chrome can now surf the web for you

AI
With Auto Browse, Google Chrome can now surf the web for you

by Frederic Lardinois

Featued image for: A decade of werf, a software delivery tool for Kubernetes

Containers
A decade of werf, a software delivery tool for Kubernetes

by Dmitry Shurupov

Featued image for: Drupal turns 25: From simple to complex — then simple again

Frontend Development
Drupal turns 25: From simple to complex — then simple again

by Richard MacManus

Featued image for: Agoda’s secret to 50x scale: Getting the database basics right

Databases
Agoda’s secret to 50x scale: Getting the database basics right

by Cynthia Dunlop

Featued image for: Terraform challenger Formae expands to more clouds

Infrastructure as Code
Terraform challenger Formae expands to more clouds

by Joab Jackson

Upcoming webinar: How to architect a CI/CD pipeline that really scales

From our partner

Explore why copying CI/CD templates creates technical debt and leads to operational paralysis. Join us to learn why the future of DevOps scaling isn’t about copying YAML; it’s about inheriting it.

FLOW STATE

On-demand: 5 MFT trends shaping 2026

Get an in-depth look at the top five managed file transfer trends transforming how organizations move data in 2026 and beyond.

Catch the replay

Vercel’s json-render: A step toward generative UI

A new open source tool brings AI-generated UIs a step closer. Vercel CEO Guillermo Rauch tells frontend devs how to prepare.

Look: Your infrastructure isn’t ready for agentic development at scale

AI's code generation speed makes verification a bottleneck, but decoupling the environment from the underlying infrastructure can solve it.

Take a look

TNS Makers: Solving the Problems That Accompany API Sprawl With AI

How AI-infused platforms provide automated discovery, governance and observability, in this episode of The New Stack Makers with IBM’s Neeraj Nargund.

Watch the episode

How did we do today?