Rosenverse

This video is only accessible to Gold members. Log in or register for a free Gold Trial Account to watch.

Log in Register

Most conference talks are accessible to Gold members, while community videos are generally available to all logged-in members.

[Demo] How to re-categorize content at scale using LLMs
Gold
Wednesday, June 5, 2024 • Designing with AI 2024
Share the love for this talk
[Demo] How to re-categorize content at scale using LLMs
Speakers: Jorge Arango
Link:

Summary

Large Language Models (LLMs) are to language as spreadsheets are to numbers: tools for modeling, exploration, and development. Among their many capabilities, LLMs can alleviate chores related to the design and implementation of information architectures. But doing so requires venturing beyond chat-based interfaces. In this brief demonstration, we'll see how to use OpenAI's API and a few open source command line tools to re-categorize content in a 1,000+ page website. The techniques demonstrated can be extended to other common content organization tasks.

Key Insights

  • Manual retagging of 1,200 blog posts would take about 10 hours, but leveraging GPT-4 reduced active human time to about 2 hours.

  • Using GPT-4 via command line and shell scripts enables automated tagging outside typical chat interfaces.

  • An organically grown taxonomy over 20 years contained unclear acronyms and inconsistent tag forms that GPT initially struggled with.

  • Cleaning and standardizing the taxonomy before prompting GPT is critical for effective AI assistance.

  • A review step of AI-suggested tags in CSV format allows human correction to avoid hallucinations entering production.

  • GPT-4 can propose new and useful tags outside the original taxonomy, enriching content classification.

  • The four-step GRU framework (Gather, Review, Update, Wrap up) balances automation with human oversight.

  • Storing blog content as markdown files simplifies integrating AI workflows via scripting and file manipulation.

  • The approach is adaptable and scalable to other CMS platforms by replacing scripting with API calls.

  • Taxonomies should use clear, unambiguous terms to improve both human and AI understanding.

Notable Quotes

"Some of the older content has discoverability problems, which is typical with blogs."

"Doing this tagging manually would have taken me around 10 hours of mind-numbing work."

"I’m actually using GPT-4, but not via the chat interface—I'm calling it from the Mac’s command line."

"I had to clean the taxonomy up because GPT wouldn’t know what to do with acronyms like TAOI."

"I save the proposed tags to a CSV file so I can preview and edit them before applying the changes."

"A middle review step prevents hallucinations from making it into the production site."

"GPT-4 functioned as an assistant not just in retagging but also in improving the taxonomy itself."

"The entire process took about three hours from start to finish, about a fifth of the manual time."

"Use clear and obvious terms in taxonomies—unusual acronyms won’t make sense to GPT or others."

"You need to review proposed changes before committing them to production, otherwise errors sneak in."

Ask the Rosenbot
Peter Merholz
The Mysterious Case of the Missing UX Career Path
2022 • DesignOps Community
Sean McKay
Coexisting with non-researchers: Practical strategies for a democratized research future
2025 • Advancing Research 2025
Gold
Etienne Fang
The Power of Care: From Human-Centered Research to Humanity-Centered Leadership
2021 • Advancing Research 2021
Gold
Amy Brana Stuart
Rest in Peace Fly-in-fly-out Design
2022 • Design at Scale 2022
Gold
Prayag Narula
Dialing for Research: How to Reach the Unreachable
2022 • Advancing Research 2022
Gold
Lona Moore
Scaling Design Beyond Designers
2021 • Design at Scale 2021
Gold
Mike Oren
Improving Democratized Research with CustomGPTs and Gems
2026 • Rosenfeld Community
Lukas Moro
“Feels Like Paper!”: Interfacing AI through Paper
2025 • Designing with AI 2025
Gold
Daniel Orbach
Zero to One: Co-Creating Operating Models with your Team
2024 • DesignOps Summit 2024
Gold
Paul Pangaro, PhD
Systems Disciplines: Table Stakes for 21st Century Organizations
2023 • Enterprise UX 2023
Gold
Andrea Gallagher
The Problem Space
2019 • Advancing Research Community
Jerome “Axle” Brown
How to Use Self-Directed Learning to Ensure Your Research Insights are Heard and Acted Upon
2021 • Advancing Research 2021
Gold
Malini Rao
Lessons Learned from a 4-year Product Re-platforming Journey
2021 • Design at Scale 2021
Gold
Megan Blocker
A Selectively Scrappy Approach to ResearchOps
2018 • DesignOps Summit 2018
Gold
Nathan Shedroff
Double Your Mileage: Use Your Research Strategically
2020 • Advancing Research 2020
Gold
Sabrina Mach
How to Design Your Design Operating Model
2021 • DesignOps Summit 2021
Gold

More Videos

Jen Briselli

"Nudge to me now is much more about wiggle it and see what happens rather than expecting exact outcomes."

Jen Briselli

Learning Is The Engine: Designing & Adapting in a World We Can’t Predict

April 16, 2025

Robert Fabricant

"This is probably the biggest change we are going to see within our working careers."

Robert Fabricant Sahibzada Mayed Nidhi Singh Rathore

Industry junctures: Paths forwards for UXR and the critical decisions that get us there [Advancing Research Community Workshop Series]

October 2, 2024

Alla Weinberg

"Our nervous system does not know the difference between a tiger and an angry email from a manager—it just senses danger."

Alla Weinberg

Design Teams Need Psychological Safety: Here’s How to Create It

September 8, 2022

Dan Hill

"We need distributed, shared, and participatory technologies for common good outcomes."

Dan Hill

Designing for the infrastructures of everyday life

June 4, 2024

Bria Alexander

"Operations is the de-risking activity that creates scalable processes for better outcomes - Theresa."

Bria Alexander Patrizia Bertini Peter Boersma Jon Fukuda Dave Malouf Theresa Slate Changying (Z) Zheng

Charting the future of DesignOps: A community workshop

April 18, 2024

Chris Geison

"Prioritization isn’t about scoring to pick projects, but about facilitating discussion and aligning the team’s understanding."

Chris Geison

What is Research Strategy?

March 11, 2021

Sam Proulx

"Nothing about us without us."

Sam Proulx

Accessibility: An Opportunity to Innovate

November 16, 2022

Dagmara Kukawka

"Co-creation is highly actionable for teams and genuinely engaging for customers."

Dagmara Kukawka

Tiny team, moonshot impact: Democratizing research across continents

March 10, 2026

Aleksandra Korczynska

"Continuous feedback requires collecting responses tied to the same user across multiple channels with behavioral targeting."

Aleksandra Korczynska Caroline Jarrett Justyna Parmee

Survey Tools

March 12, 2026