Summary
A 30-minute deep-dive into the building of the Rosenbot. We’ll get both hands-on practical and likely a bit philosophical. What does it take to build a useful AI assistant? What does it mean for a business, strategically? And how do we make sure we are building the future that we want, while doing all this? Take-aways: What does strategy look like in an AI world? What does eval-first mean? What the hell is going on deep inside an LLM? And what does all that mean for the future we want to build?
Key Insights
-
•
Generative AI is a new design material requiring fresh approaches distinct from prior UX methods.
-
•
Rose Bot leverages retrieval augmented generation to semantically search Rosenfeld’s extensive UX content.
-
•
Conversation logic beyond the LLM is essential to route, classify intents, and ensure safe responses.
-
•
Evaluation (eval) of AI is fundamentally challenging due to its non-deterministic input and output.
-
•
Effective evaluation requires combining human expert and end-user feedback alongside automated tools.
-
•
A third of a project’s budget and time should be dedicated solely to rigorous AI evaluation.
-
•
‘Prompt deep dive’ usability testing spends extended time on individual prompts to deeply understand interactions.
-
•
New tooling is emerging specifically for tracing conversations, prompt engineering, and observability in AI.
-
•
UX roles remain vital in AI development by inventing new research techniques and ensuring user-centered design.
-
•
Co-creation between users and technology defines how AI applications evolve and succeed or fail.
Notable Quotes
"When GPT came out, my kids immediately adopted it at school and couldn’t pry it from their dead hands."
"This AI stuff is a new design material, just like the internet or mobile was before."
"The Rose Bot has read everything—every piece of Rosenfeld’s knowledge—to help users access it."
"Building generative AI tools is not scary, it’s just very different from the last 20 years of building products."
"There’s a lot of steps behind the scenes to make conversations useful and safe."
"Eval is the engineering word for evaluation, but also what researchers and UX designers naturally do."
"Without proper evaluation, you’re just building a demo – demos are easy, quality production is hard."
"We invented the ‘prompt deep dive’ technique to spend lots of time on one prompt to deeply understand it."
"Co-creation between users and technology determines if this AI evolves to be useful or harmful."
"We should feel okay to throw away old assumptions and tooling and invent new techniques for this new world."
Or choose a question:
More Videos
"Alternative navigation users pick different tools based on the task and how they feel at that moment, not just one technology."
Sam ProulxSUS: A System Unusable for Twenty Percent of the Population
December 9, 2021
"The biggest bottleneck is the bureaucracy, like the Paperwork Reduction Act, we have to creatively navigate that."
Michael LandEstablishing Design Operations in Government
February 18, 2021
"I spent too much time trying to change company culture and should have focused more on just getting things done."
Shipra KayanHow we Built a VoC (Voice of the Customer) Practice at Upwork from the Ground Up
September 30, 2021
"A workshop helps people map their skills, identify gaps, and make specific goals for career jumps."
Ian SwinsonDesigning and Driving UX Careers
June 8, 2016
"The chief of staff is the bridge between our executive leadership team and the design ops practitioners."
Isaac HeyveldExpand DesignOps Leadership as a Chief of Staff
September 8, 2022
"What if product and business actually manage designs and deliver digital communications?"
Amy EvansHow to Create Change
September 25, 2024
"We’re powering a 300 plus organization of designers, researchers, program managers, and strategists."
Kate Koch Prateek KalliFlex Your Super Powers: When a Design Ops Team Scales to Power CX
September 30, 2021
"Most enterprises are not emotionally safe places; people can’t just leave emotions at the door."
Dave GrayLiminal Thinking: Sense-making for systems in large organizations
May 14, 2015
"Standalone insights destroy the researcher’s ability to storytelling and engagement."
Matt DuignanAtomizing Research: Trend or Trap
March 30, 2020
Latest Books All books
Dig deeper with the Rosenbot
What are the challenges and approaches to organizing design, content, and research operations within an enterprise?
Why is it important for design teams to invest in community outreach and academic collaborations?
What are effective strategies for designers to introduce systems thinking in organizations resistant to broad perspectives?