Rosenverse

Accessible only to conference ticket holders.

Log in Create account Buy conference recordings

For 90 days after a conference, only paid ticket holders can watch conference videos. After that, all Gold members have access.

If you have purchased recording access and cannot see the video, please contact support.

Human vs. machine: Testing AI’s ability to synthesize and analyze research

Gold
Wednesday, March 11, 2026 • Advancing Research 2026

This video is featured in the AI Trends in User Research playlist.

Share the love for this talk
Human vs. machine: Testing AI’s ability to synthesize and analyze research
Speakers: Laura Klein
Link:

Summary

Nielsen Norman Group (NNG) has conducted and continues to conduct extensive research testing various large language model (LLM) tools designed for research synthesis and analysis. Our goal was to determine whether these AI-powered tools could meaningfully accelerate the work of experienced UX researchers. Through rigorous testing across multiple models and specialized research tools, we’ve found that while a few tools provide modest speed improvements for experienced researchers, none come close to replacing human expertise in research synthesis and analysis. The core problem is that these tools consistently exhibit critical flaws: they hallucinate findings, fail to identify meaningful patterns in qualitative data, cannot adequately consider nuanced research questions, and produce only superficial, high-level summaries of participant behavior. What makes this particularly dangerous is that these AI-generated outputs often have the veneer of legitimate research results—they look professional and sound plausible. However, closer inspection reveals significant gaps, inaccuracies, and missed insights that would mislead stakeholders and result in poor design decisions. The appearance of competence masks fundamental limitations that make these tools unreliable for serious research work. While we’ve found several places in the research process that can benefit from LLM usage, analysis and synthesis consistently falls short. In this talk, I can share the specific research we’re doing and explain what actually works.

Key Insights

  • AI tools frequently produce insight-shaped outputs but often lack the rigor and accuracy of trained human researchers.

  • AI moderators cannot currently assess user behavior beyond spoken words, missing key usability observations like failed or inefficient tasks.

  • Contextual elements such as environmental interruptions are critical in research but are invisible to AI tools.

  • Synthetic users generated by AI tend to produce overly positive, unrealistic feedback that can mislead product teams.

  • AI excels at finding semantic connections and grouping codes in large, already coded qualitative datasets quickly.

  • Meta-analysis of large repositories using AI can uncover recurring user themes, like change aversion, much faster than manual methods.

  • Integrating AI with organizational systems to pull in diverse data sources improves context but requires expert setup and is not yet simple.

  • AI’s context window limitations cause it to forget earlier input, affecting the accuracy of multi-turn interactions.

  • Even trained researchers must use AI outputs cautiously, vetting insights to maintain research quality.

  • Effective user research depends on human synthesis, collaboration, and contextual understanding, areas where AI currently fails.

Notable Quotes

"AI can generate insights, but it does not do them as well as a moderately trained human researcher."

"There is a world of difference between what a participant says and what they actually do, and AI misses that completely."

"AI tells you what you want to hear, which is dangerous if you’re making product decisions based on synthetic feedback."

"Our job as researchers is not making reports or interviewing users; it’s providing actionable, correct insights."

"AI tools are incentivized to produce final deliverables, but that’s an output, not the essence of research."

"AI is pretty good at finding semantic patterns among codes after human researchers have done the initial coding."

"Nobody is going to be satisfied by insight-shaped answers or high-level summaries masquerading as breakthroughs."

"AI cannot notice body language, tone, or environmental context during a research session."

"Using AI to scan large archives of research is a game changer for meta-analyses, even if it’s imperfect."

"Well-set-up AI systems pulling data from multiple company sources will have more context, but it’s still limited compared to human understanding."

Ask the Rosenbot
Bria Alexander
Day 2 Welcome
2024 • DesignOps Summit 2024
Gold
Himanshu Bharadwaj
If design had a heart
2026 • Rosenfeld Community
Kristin Sundermeyer
Design Ops Metrics
2021 • DesignOps Summit 2021
Gold
Frances Yllana
Theme 2 Intro
2024 • DesignOps Summit 2024
Gold
Maria Skaaden
Panel Discussion: Methodologies and Work Environments
2018 • DesignOps Summit 2018
Gold
Dan Willis
Enterprise Storytelling Sessions
2016 • Enterprise UX 2016
Gold
Patrick Boehler
Fishing for Real Needs: Reimagining Journalism Needs with AI
2025 • Designing with AI 2025
Gold
Kara Kane
Communities of Practice for Civic Design
2022 • Civic Design Community
Jemma Ahmed
Collaboration: learning from other fields beyond our own [Advancing Research Community Workshop Series]
2024 • Advancing Research Community
Melinda Belcher
Insider preview of Enterprise Experience 2020
2020 • Enterprise Community
Sam Ladner
How Research Can Drive Strategic Foresight
2022 • Advancing Research 2022
Gold
Peter Merholz
Design at Scale is People!
2021 • Design at Scale 2021
Gold
Noah Bond
Redefining truth and inclusivity: Navigating data ownership and ethical research in the age of disinformation
2025 • Advancing Research 2025
Gold
Bria Alexander
OKRs—Helpful or Harmful?
2022 • DesignOps Community
Louis Rosenfeld
Coffee with Lou: Should You Write a (UX) Book?
2024 • Rosenfeld Community
Jemma Ahmed
Theme Three Intro
2023 • Advancing Research 2023
Gold

More Videos

Benjamin Real

"We need to share the importance of design ops visibly so everyone understands their role in enabling design success."

Benjamin Real

Showing the Value of DesignOps by Not Having a DesignOps Team

October 21, 2020

Louis Rosenfeld

"As a starter, I’m great at beginning things, but the company has the infrastructure to maintain them."

Louis Rosenfeld

The Rosenbot and the Rosenverse: An AMA with Lou Rosenfeld

June 5, 2024

Bria Alexander

"If you ever feel unsupported or unsafe, please let us know. It is our priority to make sure you feel protected and supported."

Bria Alexander

Opening Remarks

June 11, 2021

Shelby Switzer

"The nitty gritty is a space to become resources for each other so people know who to talk to when they have questions."

Shelby Switzer

Making Space for Community Knowledge-sharing in a Distributed World

December 10, 2021

Sarah Gallimore

"Evan actually spent less than 15 minutes on the essay, and instead of working on homework, he was writing a letter to his significant other back home in Detroit."

Sarah Gallimore

Inspire Progress with Artifacts from the Future

November 18, 2022

Alana Washington

"Design ops is the heart, the connective tissue of our design organizations."

Alana Washington

Theme 1: Introduction and Provocation

January 8, 2024

Tony Turner

"Stakeholders can do their own searching at a high-level label level, while researchers dive deeper into nuanced tags."

Tony Turner

Capturing Deep Insights

September 30, 2021

James Wieselman Schulman

"Ten interviews is enough to get most of the way to meaningful insights."

James Wieselman Schulman

Research is a team sport: advancing the work when everyone does the research

March 11, 2026

Jemma Ahmed

"If I am difficult, then I am in good company, esteemed company, the best of company."

Jemma Ahmed

Theme 2 Intro

March 26, 2024