Rosenverse

This video is only accessible to Gold members. Log in or register for a free Gold Trial Account to watch.

Log in Register

Most conference talks are accessible to Gold members, while community videos are generally available to all logged-in members.

Latent Scope: Finding structure in unstructured data
Gold
Wednesday, June 11, 2025 • Designing with AI 2025

This video is featured in the AI and UX playlist and 1 more.

Share the love for this talk
Latent Scope: Finding structure in unstructured data
Speakers: Ian Johnson
Link:

Summary

As a data visualization designer and developer the challenge I often face is what to do with unstructured data. One case study I can show is exploring survey results where the multiple-choice questions are straightforward to analyze but interesting open-ended questions like “What do your colleagues not understand about data visualization?” are much harder to crack. Latent Scope is an open-source tool I built that streamlines a process of embedding text, mapping it to 2D, clustering the data points on the map and summarizing those clusters with an LLM. Once the process is done on a dataset structure emerges from the unstructured text, allowing us to get a sense of patterns in the survey answers. Themes like “the time it takes” to develop data visualization pop out, as do “the importance of good design.” While people don’t use the same language to describe these themes, they show up as clusters in the tool thanks to the power of embedding models. https://github.com/enjalot/latent-scope

Key Insights

  • AI embeddings transform unstructured data like text or sketches into high-dimensional vectors capturing hidden semantic patterns.

  • Dimensionality reduction algorithms map these high-dimensional embeddings into 2D clusters, making patterns visually accessible.

  • Free text survey responses, often too large to analyze manually, can be organized and explored effectively through embedding and clustering.

  • Google Cloud user journey data revealed surprising usage patterns, like enterprise and beginner tools being combined unexpectedly.

  • Simple rule-based filters for harmful prompts in generative AI image models can be bypassed by subtle prompt variations; embedding-based classifiers are more effective.

  • Latent Scope is an open source tool enabling non-technical users to embed, cluster, label, and visualize unstructured text data locally.

  • Local runs of embedding and clustering models on modest hardware are practical, preserving data privacy and sensitive user information.

  • Embedding models trained on multilingual data can cluster semantically similar texts across diverse languages in one shared space.

  • Breaking long text into smaller chunks can make embeddings more manageable and improve similarity comparisons.

  • Hands-on exploration of embedding models via platforms like Hugging Face helps users internalize concepts of semantic similarity and pattern discovery.

Notable Quotes

"People just don't get that the design and the process is fundamental."

"Similar inputs will produce similar high dimensional numbers."

"Dimensionality reduction algorithms take data points in high dimensional space and put them close together in 2D if they're similar."

"We found patterns that product teams didn’t expect or even want to look for."

"Simple word-list based filters can easily be tricked by misspellings or slight variations in prompts."

"What if you didn’t know there were important questions you should be asking in your data?"

"Latent Scope lets you quickly explore hundreds or thousands of free text responses to find clusters and patterns."

"You don’t need special hardware; these open source models can run locally on an M1 MacBook or a gaming machine."

"Multilingual embedding models can cluster similar meanings across languages in a shared latent space."

"Downloading and playing with local open source models gives a different experience than using faceless APIs mediated through interfaces."

Ask the Rosenbot
Nora Tejeda
Scaling Design Capabilities at BBVA Through a Self-service Design Model
2021 • Design at Scale 2021
Gold
Matt Webb
Context Window: Five Futures for AI
2025 • Designing with AI 2025
Gold
John Donmoyer
Shipping your code generation experiments to production
2025 • Designing with AI 2025
Gold
Bria Alexander
Opening Remarks Day 1
2024 • Advancing Research 2024
Gold
Sarah Williams
Verizon_A Framework for CX Transformation
2024 • Design at Scale 2021
Gold
Rich Mironov
How Can Product Managers and UXers Help Each Other (and Why are Product Folks so Annoying Sometimes)?
2022 • Design in Product 2022
Gold
Michael Weir
Mixed Methods and Behavioural Science
2023 • Rosenfeld Community
Rachael Dietkus, LCSW
Everything You Need to Know about the Civic Design 2022 Call for Presentations
2022 • Civic Design Community
Sam Ladner
Data Exhaust and Personal Data: Learning from Consumer Products to Enhance Enterprise UX
2016 • Enterprise UX 2016
Gold
Ariel Kennan
Theme Two Intro
2022 • Civic Design 2022
Gold
Greg Petroff
The Compass Mission
2021 • Advancing Research 2021
Gold
Sam Proulx
Designing For Screen Readers: Understanding the Mental Models and Techniques of Real Users
2021 • DesignOps Summit 2021
Gold
Yunyan Li
UX Best Practices
2021 • Design at Scale 2021
Gold
Andrew Webster
Scaling Design Capability: How Involved Should You Be?
2021 • DesignOps Summit 2021
Gold
Cara Maritz
The Art of Extrapolation
2023 • Advancing Research 2023
Gold
Sean McKay
Coexisting with non-researchers: Practical strategies for a democratized research future
2025 • Advancing Research 2025
Gold

More Videos

Lada Gorlenko

"Operations tends to be something companies only think about when they are scaling rapidly, but investing early is essential."

Lada Gorlenko Sharbani Dhar Sébastien Malo Rob Mitzel Ivana Ng Michal Anne Rogondino

Theme 1: Discussion

January 8, 2024

Mila Kuznetsova

"If a group session with middle schoolers goes off the rails, one-on-one sessions can save the research."

Mila Kuznetsova Lucy Denton

How Lessons Learned from Our Youngest Users Can Help Us Evolve our Practices

March 9, 2022

Jake Burghardt

"Scaling up research impact means turning visibility way up and opening access despite some risk of misuse."

Jake Burghardt

Stop wasting research: Create new value with insight summaries

July 9, 2025

Liam Thurston

"Soft skills often are a multiplier on your hard skills—they’re just as valuable, if not more so."

Liam Thurston

Why Your Design Team Is Quitting, And How To Fix It

June 10, 2022

Alexandra Schmidt

"Standard design research looks for pain points, not harms, and harms often cannot be identified in typical user research."

Alexandra Schmidt

Why Ethics Can't Save Tech

November 18, 2022

Renee Bouwens

"The sum is greater than the parts—how qualitative and quantitative research play together."

Renee Bouwens

Landing Product Impact: Aligning Research as a Foundational Driver for Delivering the World’s Best Products

December 15, 2023

Bria Alexander

"Every talk you hear today couldn’t get more personal to our community."

Bria Alexander

Theme Two Intro

October 3, 2023

Steve Sanderson

"If you're changing workflows in experiments, you need to keep track of impacted teams like call centers to avoid resistance."

Steve Sanderson Alissa Briggs Jeff Gothelf Bill Scott

Discussion

May 14, 2015

Mary-Lynne Williams

"I walked into the building and it just felt surreal. It didn’t feel right in my body to be there anymore."

Mary-Lynne Williams

Exit Interview #4: From Product Design Leadership to Sound Healing

January 14, 2026