February 2025

DeepSeek and the rise of AI reasoning

“AI is amazing at guessing quickly, but it fundamentally can’t reason.”

That is a quote from someone I know very well, circa 2022. I wonder what the thought process was; the reasoning that went into that statement. Luckily I don’t have to guess, because that person was me. I made that statement in response to the first rounds of LLMs (GPT3, 3.5, PALM2 etc). It remained a firm conviction of mine through the release of GPT-4o and Anthropic’s latest Claude models.

Both proved incredible at resolving problems, but only if they were presented with all the facts, and subsequently, guided through the steps. We were “patching” the reasoning process with hacks, such as prompt engineering, RAG systems, multi-agent conversations, chain-of-thought prompting, and so on.

At Tricentis, our team is working day to day implementing LLMs to solve complex multi-step problems, so we were at the coalface, seeing just how unreliable these solutions became in practice. Frustrating days passed, as we tried to figure out “Why wont the AI do the obvious next step!?” Or, as we added more and more complex prompts, to shoehorn predictability at the cost of generalized usefulness. At every turn, the conviction that AI couldn’t reason became more concrete in my mind.

Until now. So what changed?

On Jan. 20, the Chinese AI company DeepSeek released a language model called R1 that, according to the company, outperforms industry leading models like OpenAI o1 on several benchmarks. These two models fit into a new class: Models designed, and trained, to reason. Let’s dive into why that matters, and why DeepSeek R1 specifically has sent shockwaves through the AI industry.

Traditional LLMs can’t reason

The way traditional language models (strange as it may sound to say that about a technology all of 8 minutes old) are trained is generally in two ways:

1. Fill in the blank

A common training method involves masking words in a sentence and having the model predict what those words should be. This is a form of unsupervised training, since no human intervention is needed, and is the baseline of all LLMs. By showing them massive amounts of text, where certain words are obscured, they learn to predict what words fill that gap based on the surrounding text. It’s a massive oversimplification, hiding the complexity of attention heads and tokens, but essentially this allows them to learn the manifold meaning of language, in context, not just the dictionary definitions.

2. Reinforcement learning with human feedback (RLHF)

Human reinforcement, or RLHF, is a supervised technique, often used for fine-tuning after the initial training. These teach the AI to get better at giving the “type of responses humans expect.” Yes, that’s a vague phrase, but we are vague beings. Here, we tune the LLM to answer questions, respond to conversation, and offer feedback that improves subsequent responses. We basically teach it to be a chatbot, and as a by-product it learns to approximate solutions to some problems, and to deliver them in language that its trainers have deemed socially acceptable.

This is not how we solve complex problems

Think about how you would solve a complex problem. This could be anything from designing an application feature, to writing this blog post. You plan, you build, you iterate, you correct, you conclude. It is a multi-step process, involving continual reflection: Am I doing this right? Is this the right track? Do I need to adjust?

This is where traditional LLMs fail. They are trained to give the answer in one quick shot.

Thinking fast and slow

In his book “Thinking, Fast and Slow,” Daniel Kahneman proposes two models for how we think:

These map really well to the patterns of language models. All the models released prior to OpenAI’s o1 model were essentially System 1 models. They responded fast, matched patterns, and did their best based on what you gave them.

Again, what changed?

OpenAI o1 and the introduction of System 2 (reasoning) models

OpenAI is not Open AI

When OpenAI launched o1 in September of 2024, they titled the post “Learning to reason with LLMs.” The results in benchmarks were pretty impressive, especially in complex tasks:

I found it particularly fascinating that it showed no improvement in AP English, demonstrating once and for all to my high school English teacher that English is irrational.

But OpenAI only published the what. They gave very little information as to the how. This was something of a competitive differentiator. Claims of massive compute resources needed, cups of water vanishing for every request, and power requirements needing entire nations to redo their energy grids fed the Venture Capital machine to push more and more money into these heavily funded, transparent-as-mud AI enterprises, seeing the promise of human-level reasoning models finally coming to fruition.

Evidence of System 2 “thinking:” The shift to test-time compute

O1 was the first model to shift the way models responded to questions. Instead of immediately launching into the answer, like a nervous intern at a job interview, they began to think first by design. When answering a question, the model will immediately begin an internal monologue, planning out its actions. When OpenAI launched o1, they called this “the hidden chain of thought,” and like everything else OpenAI does, the techniques were hidden along with it. The result, however, was that the o1 series of models began to output step-by-step complex plans to solve difficult problems, achieving impressive results!

Partial o1 response to a chemistry question

This was the shift away from train-time compute, where the model learns the patterns and responds ‘instinctually,’ to test-time compute, where the model responds far slower, with more compute required (driven by that ‘internal monologue’) before giving a thoughtful, planned-out answer.

Why does DeepSeek change the game?
DeepSeek is Open AI (sort of)

When DeepSeek published their paper on how they trained DeepSeek R1, they published training techniques, experiments, ablation studies (comparisons with state of the art), failures, future experiments, quantization and optimization methods. When they published their model, it was ready to be fine-tuned, easily accessible on Hugging Face, and open for use. It was published under an open MIT License, meaning it can be used commercially and without restrictions.

What DeepSeek didn’t publish is the dataset they used to train R1. That remains closed. But speculation abounds that they used OpenAI models to train and fine tune their own. Aside from that minor detail, R1 is a very transparent AI release, which allows enterprises and researchers to experiment with and train powerful reasoning models on their own use cases – potentially at a significantly lower cost, and with significantly fewer resources.

DeepSeek is cheap and fast

Alongside its full-size model, DeepSeek also released distilled versions of R1, or quantized models, that have been optimized to run on consumer hardware, opening up the opportunity for edge LLM and calling a major AI industry assumption into question – that the best way to make AI models smarter is by giving them more computing power. Benchmarks are still pouring in, but this Reddit post is replete with examples of models running on Mac M3, or consumer-grade Nvidia chips.

This may be the reason behind the massive sell-off of Nvidia stock; however, it should be noted that DeepSeek has access to a massive number of Nvidia H100 chips, at a lowball retail estimate of $1.5 billion, so training models is still the domain of the well-funded. It has been widely reported that DeepSeek spent $6 million on the hardware used for R1’s final training run. I would be shocked if the total cost was less than $50 million, so take the hype here with a small trailer-load of salt.

DeepSeek demystified reasoning

Most importantly, DeepSeek contributed the “how” of reasoning models back to the general public, allowing researchers, startups, and giant tech companies to train their own reasoning models, on specific use cases. They also provided invaluable lessons learned, here are a few of my favorites, taken from the DeepSeek paper:

By incentivizing the model to think first, but not teaching it what to think, the model naturally learned how to expand the amount of time spent thinking prior to answering, which led to better solutions in more complex problems:

Without prompting, the model learned to “rethink,” revaluating its path and identifying and correcting mistakes in reasoning. Since what we get from the model is essentially a stream of consciousness, it was almost charming how “humanesque” it was when identifying its mistakes:

Guidance (cold start data) is still required to “humanize” the outputs. Without it, the model was completely happy generating its chain of thought while swapping between languages and formats, but it was absolutely unreadable. A little example data went a long way here.

Price at the cost of privacy

Much of the panic in the AI market has been driven by the fact that DeepSeek is offering its full R1 API at pennies on the dollar compared to OpenAI:

API pricing comparison between DeepSeek-R1 and OpenAI models by DeepSeek API Docs

Yes, those invisible bars are DeepSeek prices. But there is a heavy hidden cost here.

One thing DeepSeek is not open about is how they use, store, and manage the data that you send to them. Reports of it collecting keystrokes, prompt, audio, and video data and open up legitimate concerns about how that data is used, for what purpose, and by whom. I am not about to wade into the murky land of geopolitics, but suffice it to say that you should talk to your organization’s legal counsel and data privacy team before you even send your first prompt. For reference, this is the reaction that mine gave:

But what about o3?

This is true; the DeepSeek models are not as good as the o3 models, but I am convinced that they will be. I am convinced of it because the template that they provided for how these models are trained gives a low-cost path towards incrementally improving, specializing and deploying these models for tasks such as coding, math, and science. We will see the next version of DeepSeek within months, and the open research alternatives and improvements are already popping up.

The moat built around OpenAI’s reasoning models has been bridged, and the game is afoot.

What does this mean for QA and DevOps?
These are planning tasks

Current models are doing alright at dev and QA tasks. They are proving good at generating code, or proposing tests, but they have a shortcoming. They perform poorly when the task requires thinking one or two layers deeper. For coding, this might be considering design principles or planning out an API library before implementing. For QA, this takes the form of applying testing strategies, considering techniques such as boundaries, security loopholes, confounding variables, and combinations of inputs that could provoke defective behavior.

O1 models are good, but most of the data is confidential

At Tricentis, we have seen a strong reticence in our customers towards fully adopting cloud-based, opaque models. They may trust us, but we are asking them to also relay that trust on to a third party (OpenAI). This is a bridge too far for many data-savvy enterprises or groups that have heavy regulatory or privacy burdens.

DeepSeek opens the door to custom, private, edge AI

DeepSeek provides the ideal blend of performance (being a competent reasoning model), adaptability (through fine tuning, with models like DeepSeek Coder already popping up), and Deployability. I see this being a game changer for the highly sensitive world of development and QA.

Closing thoughts

If you made it this far by reading the whole article, thank you. If you skipped to the end to get the conclusions, here they are:

DeepSeek is a game changer for the open source community, providing bleeding edge performance without the opacity of API deployed models
Reasoning models are the next phase of AI development, and thanks to this open research will accelerate faster than I had thought possible
AI that can think is here, and the types of tasks it can do are fairly unbounded
DeepSeek™ still has some serious privacy problems, so handle with care If you are interested in how we use LLMs to solve problems in the testing and DevOps space, you can explore our Copilot solutions here or feel free to connect with me on LinkedIn.

Author: David Colwell

VP, AI & Machine Learning

Date: Jan. 30, 2025

Unlocking Business Growth with SEO

In today’s digital-first world, having a strong online presence is no longer optional—it’s essential. Search Engine Optimization (SEO) is at the heart of this digital strategy, driving visibility, engagement, and growth for businesses of all sizes. With over 93% of online experiences starting with a search engine (BrightEdge), SEO has proven to be one of the most effective tools for generating long-term, sustainable success.

What Is SEO and Why Does It Matter?

SEO is the process of optimizing a website to rank higher in search engine results pages (SERPs), making it easier for potential customers to find your business. It’s about matching your content with what users are searching for and ensuring your website delivers the best experience possible.

According to a study by HubSpot, 75% of users never scroll past the first page of search results. This statistic underscores the importance of securing a spot on the first page if you want to capture potential leads and drive traffic to your site.

How SEO Benefits Organizations

1. Increased Website Traffic

2. Higher ROI

3. Enhanced Credibility

4. Cost-Effectiveness

5. Targeted Reach

The Financial Impact of SEO

Key SEO Strategies to Drive Success

1. Keyword Research and Optimization

2. Content Creation

3. Technical SEO

4. Backlink Building

5. Local SEO

SEO in Action: Real-World Examples

Why SEO Is a Long-Term Investment

Unlike paid ads, which stop generating traffic as soon as you pause the campaign, SEO continues to deliver results over time. A well-optimized website can maintain its rankings and attract consistent traffic for months or even years. This long-term value makes SEO an indispensable part of any business strategy.

How LogIQ Curve Can Help

At LogIQ Curve, we specialize in crafting data-driven SEO strategies tailored to your business goals. Our team of experts leverages cutting-edge tools and proven techniques to:

Our SEO Services Include:

Take Your Business to the Next Level with SEO

The numbers speak for themselves—SEO is a powerful tool that can transform your business. Whether you’re looking to generate more leads, increase sales, or build brand credibility, investing in SEO is a decision you won’t regret.

Ready to elevate your online presence? Contact LogIQ Curve today to discover how our professional SEO services can help you achieve your business goals and unlock new levels of success.

Stay ahead. Stay visible. Grow with SEO.

Staff Augmentation

In an increasingly competitive and fast-paced global market, organizations face mounting challenges to stay agile, efficient, and cost-effective. Among these challenges, talent acquisition and workforce scalability are critical factors that can make or break project success. Staff augmentation has emerged as a game-changing solution, enabling organizations to bridge skill gaps, reduce costs, and optimize their operational efficiency.

What Is Staff Augmentation?

Staff augmentation is a flexible outsourcing model that allows organizations to hire skilled professionals on a temporary basis. Unlike traditional hiring, staff augmentation provides businesses with on-demand access to a global talent pool, enabling them to scale their teams based on specific project requirements.

According to a report by Statista, the global outsourcing market is valued at over $92.5 billion, with staff augmentation gaining significant traction as a preferred engagement model due to its flexibility and cost-effectiveness.

The Rising Need for Staff Augmentation

1. Bridging Skill Gaps

With the rapid evolution of technology, organizations often struggle to find talent with expertise in niche areas like AI, machine learning, cybersecurity, and cloud computing. Staff augmentation allows businesses to hire professionals with specialized skills, ensuring projects are completed on time and to the highest standards.

2. Cost-Effective Talent Acquisition

Traditional hiring processes can be expensive and time-consuming. On average, it takes 36 days to hire a new employee, costing businesses around $4,425 per hire (SHRM). Staff augmentation eliminates these overheads by providing ready-to-deploy talent, reducing recruitment time and costs significantly.

3. Scalability and Flexibility

Whether it’s a short-term project or a peak workload period, staff augmentation enables organizations to scale their workforce up or down as needed. This adaptability ensures that businesses only pay for the resources they require, improving cost efficiency.

How Staff Augmentation Helps Organizations Save on Costs

At LogIQ Curve, we understand that cybersecurity is more than just firewalls and antivirus software. It’s about creating a resilient digital environment where risks are identified, mitigated, and continuously monitored. Our tailored solutions help organizations of all sizes build robust defences and respond effectively to emerging threats.

1. Eliminating Overhead Expenses

Hiring full-time employees involves not only salaries but also benefits, office space, training, and onboarding expenses. With staff augmentation, businesses save an estimated 30-40% on overhead costs by hiring remote or contract professionals.

2. Reduced Training Costs

Staff augmentation professionals are often highly experienced in their respective fields, requiring minimal training. This saves organizations both time and money, allowing them to focus on project execution.

3. Improved Resource Utilization

With staff augmentation, businesses avoid overstaffing during low-demand periods. By hiring professionals on a project-by-project basis, organizations can allocate resources more efficiently, reducing waste and optimizing budgets.

4. Access to Global Talent

Staff augmentation taps into a global talent pool, offering cost advantages in regions with lower labour costs. For instance, hiring a software developer in South Asia can cost up to 60% less than hiring a developer in North America, without compromising on quality.

Key Benefits of Staff Augmentation

Real-World Applications of Staff Augmentation

1. Software Development

Organizations often face tight deadlines for app development or software upgrades. Staff augmentation ensures they have access to developers with expertise in cutting-edge technologies like Python, React, and AI/ML.

2. Cybersecurity

With cyberattacks costing businesses an average of $4.24 million per breach (IBM), having the right cybersecurity talent is crucial. Staff augmentation helps organizations hire specialists for threat detection, compliance, and incident response.

3. Digital Transformation

As businesses migrate to cloud platforms or adopt IoT solutions, staff augmentation provides the necessary technical expertise to ensure seamless transitions and minimize disruptions.

Why Choose LogIQ Curve for Staff Augmentation?

At LogIQ Curve, we specialize in providing tailored staff augmentation solutions that align with your business objectives. Our services include:

Statistics Highlighting the Impact of Staff Augmentation

Staff augmentation is more than just a cost-saving measure—it’s a strategic approach to building a flexible, agile workforce. Whether you’re tackling a complex project, bridging skill gaps, or scaling your team for peak demand, LogIQ Curve is your trusted partner in success.

Contact us today to explore how our staff augmentation services can transform your organization, optimize costs, and help you achieve your business goals.

Let’s grow together.

The Imperative of Cybersecurity

In today’s hyper-connected world, cybersecurity is not just a technical requirement—it is a strategic necessity for every organization. As businesses continue to embrace digital transformation, the attack surface expands, exposing critical assets to an ever-growing array of cyber threats. Protecting these assets is no longer optional; it is vital to preserving trust, ensuring operational continuity, and achieving long-term success.

The Evolving Threat Landscape

Cyberattacks are growing in sophistication, frequency, and impact. From ransomware that cripples operations to phishing scams that target employees, attackers exploit vulnerabilities across systems, networks, and human behaviour. Recent statistics reveal alarming trends:

These realities highlight the critical importance of a comprehensive cybersecurity strategy tailored to your organization’s unique needs.

Why Cybersecurity Matters for Every Organization

1. Data Protection

2. Operational Continuity

3. Compliance Requirements

4. Intellectual Property Security

5. Resilience Against Emerging Threats

Comprehensive Cybersecurity Solutions by LogIQ Curve

Our Key Offerings Include:

1. Vulnerability Assessment and Penetration Testing (VAPT)

2. Information Security Management (GRC)

3. Cloud Security

4. Managed Security Services

5. Employee Awareness Training

Building a Culture of Security

A strong cybersecurity posture begins with a security-first culture. Organizations must prioritize cybersecurity at every level—from the boardroom to the IT department. This involves:

The Cost of Neglecting Cybersecurity

Failing to prioritize cybersecurity can lead to devastating consequences:

Organizations that invest in cybersecurity not only protect themselves but also position themselves as trustworthy and reliable partners in their industries.

Future-Proof Your Business with LogIQ Curve

Cybersecurity is not a one-time task—it’s an ongoing commitment. At LogIQ Curve, we partner with organizations to build adaptive, scalable cybersecurity frameworks that evolve with changing threats. Our expertise ensures that your organization remains protected, compliant, and ready for the future.

Secure today to thrive tomorrow. Let us help you safeguard your digital assets and build resilience against evolving cyber threats. Contact LogIQ Curve for tailored cybersecurity solutions designed to protect and empower your business.

Stay Ahead, Stay Secure.

LogIQ Curve

Traditional LLMs can’t reason

1. Fill in the blank

2. Reinforcement learning with human feedback (RLHF)

Thinking fast and slow

OpenAI o1 and the introduction of System 2 (reasoning) models

Evidence of System 2 “thinking:” The shift to test-time compute

Why does DeepSeek change the game? DeepSeek is Open AI (sort of)

DeepSeek is cheap and fast

DeepSeek demystified reasoning

Price at the cost of privacy

But what about o3?

What does this mean for QA and DevOps? These are planning tasks

O1 models are good, but most of the data is confidential

DeepSeek opens the door to custom, private, edge AI

Closing thoughts

VP, AI & Machine Learning

What Is SEO and Why Does It Matter?

How SEO Benefits Organizations

1. Increased Website Traffic

2. Higher ROI

3. Enhanced Credibility

4. Cost-Effectiveness

5. Targeted Reach

The Financial Impact of SEO

Key SEO Strategies to Drive Success

1. Keyword Research and Optimization

2. Content Creation

3. Technical SEO

4. Backlink Building

5. Local SEO

SEO in Action: Real-World Examples

Why SEO Is a Long-Term Investment

How LogIQ Curve Can Help

Our SEO Services Include:

Take Your Business to the Next Level with SEO

What Is Staff Augmentation?

The Rising Need for Staff Augmentation

1. Bridging Skill Gaps

2. Cost-Effective Talent Acquisition

3. Scalability and Flexibility

How Staff Augmentation Helps Organizations Save on Costs

1. Eliminating Overhead Expenses

2. Reduced Training Costs

3. Improved Resource Utilization

4. Access to Global Talent

Key Benefits of Staff Augmentation

Real-World Applications of Staff Augmentation

1. Software Development

2. Cybersecurity

3. Digital Transformation

Why Choose LogIQ Curve for Staff Augmentation?

Statistics Highlighting the Impact of Staff Augmentation

Statistics Highlighting the Impact of Staff Augmentation

The Evolving Threat Landscape

Why Cybersecurity Matters for Every Organization

1. Data Protection

2. Operational Continuity

3. Compliance Requirements

4. Intellectual Property Security

5. Resilience Against Emerging Threats

Comprehensive Cybersecurity Solutions by LogIQ Curve

Our Key Offerings Include:

1. Vulnerability Assessment and Penetration Testing (VAPT)

2. Information Security Management (GRC)

3. Cloud Security

4. Managed Security Services

5. Employee Awareness Training

Building a Culture of Security

The Cost of Neglecting Cybersecurity

Future-Proof Your Business with LogIQ Curve

Seamless Solutions, Exceptional Results

Ready to Scale?

Let’s Elevate Your Business Together!

Vision

Mission

Services

Pages

Why does DeepSeek change the game?
DeepSeek is Open AI (sort of)

What does this mean for QA and DevOps?
These are planning tasks