Rosenverse

This video is only accessible to Gold members. Log in or register for a free Gold Trial Account to watch.

Log in Register

Most conference talks are accessible to Gold members, while community videos are generally available to all logged-in members.

[Demo] How to re-categorize content at scale using LLMs
Gold
Wednesday, June 5, 2024 • Designing with AI 2024
Share the love for this talk
[Demo] How to re-categorize content at scale using LLMs
Speakers: Jorge Arango
Link:

Summary

Large Language Models (LLMs) are to language as spreadsheets are to numbers: tools for modeling, exploration, and development. Among their many capabilities, LLMs can alleviate chores related to the design and implementation of information architectures. But doing so requires venturing beyond chat-based interfaces. In this brief demonstration, we'll see how to use OpenAI's API and a few open source command line tools to re-categorize content in a 1,000+ page website. The techniques demonstrated can be extended to other common content organization tasks.

Key Insights

  • Manual retagging of 1,200 blog posts would take about 10 hours, but leveraging GPT-4 reduced active human time to about 2 hours.

  • Using GPT-4 via command line and shell scripts enables automated tagging outside typical chat interfaces.

  • An organically grown taxonomy over 20 years contained unclear acronyms and inconsistent tag forms that GPT initially struggled with.

  • Cleaning and standardizing the taxonomy before prompting GPT is critical for effective AI assistance.

  • A review step of AI-suggested tags in CSV format allows human correction to avoid hallucinations entering production.

  • GPT-4 can propose new and useful tags outside the original taxonomy, enriching content classification.

  • The four-step GRU framework (Gather, Review, Update, Wrap up) balances automation with human oversight.

  • Storing blog content as markdown files simplifies integrating AI workflows via scripting and file manipulation.

  • The approach is adaptable and scalable to other CMS platforms by replacing scripting with API calls.

  • Taxonomies should use clear, unambiguous terms to improve both human and AI understanding.

Notable Quotes

"Some of the older content has discoverability problems, which is typical with blogs."

"Doing this tagging manually would have taken me around 10 hours of mind-numbing work."

"I’m actually using GPT-4, but not via the chat interface—I'm calling it from the Mac’s command line."

"I had to clean the taxonomy up because GPT wouldn’t know what to do with acronyms like TAOI."

"I save the proposed tags to a CSV file so I can preview and edit them before applying the changes."

"A middle review step prevents hallucinations from making it into the production site."

"GPT-4 functioned as an assistant not just in retagging but also in improving the taxonomy itself."

"The entire process took about three hours from start to finish, about a fifth of the manual time."

"Use clear and obvious terms in taxonomies—unusual acronyms won’t make sense to GPT or others."

"You need to review proposed changes before committing them to production, otherwise errors sneak in."

Ask the Rosenbot
Trisha Terhar
Empathizing with the Empowered: Non-Researcher Responses to Democratization
2022 • Advancing Research 2022
Gold
Jon Fukuda
Theme 3 Intro
2024 • DesignOps Summit 2024
Gold
Peter Merholz
Design at Scale is People!
2021 • Design at Scale 2021
Gold
Dave Malouf
Theme 3: Introduction and Provocation
2024 • DesignOps Summit 2020
Gold
Dan Willis
Enterprise Storytelling Sessions
2019 • Enterprise Experience 2019
Gold
Allison Sanders
Operating with Purpose
2024 • DesignOps Summit 2020
Gold
Marisa Bernstein
It Takes GRIT: Lessons from the Small, but Mighty World of Civic Usability Testing
2021 • Civic Design 2021
Gold
Megan Blocker
A Selectively Scrappy Approach to ResearchOps
2018 • DesignOps Summit 2018
Gold
Dave Hoffer
UX Job Search AMA #3 with Joanne Weaver and Dave Hoffer
2025 • Rosenfeld Community
Saara Kamppari-Miller
Key Metrics: Comparing Three Letter Acronym Metrics That Include the Word “Key”
2024 • DesignOps Community
Melissa Eggleston
Practical People Skills for Building Trust on Teams and with Partners
2021 • Civic Design 2021
Gold
Nicole Aleong
What UX research can learn from other research practices [Advancing Research Community Workshop Series]
2023 • Advancing Research Community
Megan Blocker
What UX research maturity looks like and how we get there [Advancing Research Community Workshop Series]
2023 • Advancing Research Community
Doug Powell
Closing Keynote: Design at Scale
2018 • DesignOps Summit 2018
Gold
Shanti Mathew
Civic Design at Scale: Introducing the Public Policy Layer Cake
2021 • Civic Design 2021
Gold
Brad Peters
Short Take #1: UX/Product Lessons from Your Industry Peers
2022 • Design in Product 2022
Gold

More Videos

Alison Rand

"Design operations is really about how we’re scaling the work, thinking our practices, and serving cross-functional teams."

Alison Rand Sarah Brooks

Scaling Impact with Service Design

March 25, 2021

Carl Turner

"The difference between the way people talk about how things are done and how they are actually done reveals unconscious assumptions."

Carl Turner

You Can Do This: Understand and Solve Organizational Problems to Jumpstart a Dead Project

March 28, 2023

Sarah Flamion

"People make decisions with limited information, especially about more distant parts of the system."

Sarah Flamion

Complex Problem? Add Clarity by Combining Research and Systems Thinking

March 31, 2020

Silke Bochat

"If you do nothing, you cannot influence the future. Destination thinking shifts you from reactive to proactive future-oriented leadership."

Silke Bochat

5 Antifragile Strategies for a DesignOps 2.0

September 23, 2024

Uday Gajendar

"The hardest talk of all time is five minutes, not 30 or 45."

Uday Gajendar Lada Gorlenko Dave Malouf Louis Rosenfeld Dan Willis

10 Years of Enterprise UX: Reflecting on the community and the practice

June 18, 2025

Dan Willis

"When the dot com bubble burst, I had a great design job. Unfortunately, it was at another dot com."

Dan Willis

Enterprise Storytelling Sessions

June 8, 2017

Kara Kane

"Having a monthly newsletter gave people one place to go instead of having to check Slack or emails all the time."

Kara Kane

Communities of Practice for Civic Design

April 7, 2022

Nalini Kotamraju

"For the first time, I was able to advocate for funding directly for the research function at high executive levels."

Nalini Kotamraju

Research After UX

March 25, 2024

Bria Alexander

"Lauren Cantor works with companies to create new business strategies by tackling human-centered design."

Bria Alexander

Opening Remarks

October 1, 2021