Phase 2 Labs Blog

June 19, 2025

Artificial Intelligence, Phase 2 Labs, UI/UX

Rethinking UX for AI: Trust, Truth, and Uncertainty

AI introduces new interaction patterns—and with them, new uncertainties. These systems are non-deterministic. That means even when a user asks the same question twice, the answers might not match. So what does it look like to design for that kind of unpredictability? How do we help users spot AI mistakes—without tanking their trust? One of the biggest UX challenges right now is figuring out how to acknowledge that AI gets things wrong—sometimes confidently wrong—without undermining the entire experience. Some evolving approaches: Confidence indicators: Simple...

Read Post

June 17, 2025

Enterprise AI, Phase 2 Labs

Resume Sizzler: AI Labor That Parses, Matches, and Acts with Context

Parsing, Matching, and Acting with Context: A Foundation for the Future of AI Labor Unlocking New Possibilities with Multi-Modal AI The Resume Sizzler, developed by P2 Labs, is more than a resume builder—it's a glimpse into the future of applied intelligence. A few years ago, extracting meaningful insights from unstructured data at scale was nearly impossible. Built on advanced multi-modal AI, this system transforms unstructured data into actionable insights, enabling businesses to automate complex workflows, extract meaning from documents, and make smarter...

Read Post

May 23, 2025

Engineering, Phase 2 Labs

Using the Sequential Thinking MCP Server to go from Generative to Agentic AI

What is the Sequential Thinking MCP Server from Anthropic? The Sequential Thinking MCP Server is one of many Reference MCP Servers released by Anthropic to demonstrate the capabilities of their MCP protocol. This server in particular serves the purpose of providing structure to help augment a given AI’s thinking process. The Sequential Thinking server does not do any of it’s own “thinking” or decomposing of a problem. Instead it deterministically receives structured input from an AI, validates the data in the...

Read Post

May 22, 2025

Artificial Intelligence, Phase 2 Labs

AI that Works: Real Labor, Real Value, Right Now

Office Manager Magic, Powered by Multi-Modal AI Introduction AI isn’t a futuristic concept, it’s practical labor, available today. Not just for call centers or simple tasks, but for real, dynamic work that used to require a person. This isn’t about replacing people; it’s about helping teams do more with less. The following example highlights how a LangGraph-powered system is already handling quoting and scheduling—tasks that apply to nearly any business. The bigger point: AI agents are here, and they’re ready to do...

Read Post

May 12, 2025

Phase 2 Labs

How to Build Smarter AI That Remembers What Matters: Strengthening Organizational Memory with Zep

In a recent exploration by Braxton Nunnally from Phase 2 Labs, it was examined how Zep—a memory management tool—can help AI systems retain and recall important information over time. This kind of “organizational memory” allows AI to move beyond one-off interactions and instead offer consistent, informed responses that build on past context. Common Business Pain Point: "Our AI tools don’t retain context or past interactions—users repeat themselves, teams lose knowledge, and we miss opportunities to respond more intelligently." What the Team Learned: AI Needs...

Read Post

Cutting AI Costs Without Cutting Corners header image

May 12, 2025

Phase 2 Labs

Cutting AI Costs Without Cutting Corners: How Context Caching Maximizes LLM ROI

In a recent analysis by Alan Ramirez, Phase 2 Labs explored how organizations can reduce the operational costs of Large Language Models (LLMs) by implementing context caching—a method that stores and reuses the static parts of AI prompts. This strategy minimizes redundant processing, leading to significant cost savings. Common Business Pain Point: “Our AI tools are powerful, but the cost of running them is escalating quickly—especially as usage grows across departments.” What the Team Learned: Understanding Context Caching: By separating static (unchanging) and dynamic...

Read Post

Building Organizational Memory with Zep: A Developer's Guide, blog header

April 28, 2025

Engineering, Phase 2 Labs

Building Organizational Memory with Zep: A Developer’s Guide

In today's AI-driven landscape, creating systems with long-term memory capabilities has become increasingly important. Whether you're building a customer service chatbot that remembers interactions with its users and maintaining history longer than the context window are crucial parts of a successful system. While there are many components that go into building a fully functional long-term memory solution, Zep can be a powerful tool to help developers implement long-term memory in their AI applications. What is Zep? Zep is an API-based solution that...

Read Post

April 28, 2025

Engineering, Phase 2 Labs

Optimizing LLM Costs: A Comprehensive Analysis of Context Caching Strategies

Introduction Large Language Models (LLMs) have revolutionized how organizations process and generate natural language content, but their operational costs can become significant at scale. One of the most effective techniques for reducing these costs is context caching, which allows reuse of static prompt components across multiple requests. This article examines how the three major AI providers—Google (Gemini), Anthropic (Claude), and OpenAI—implement context caching, with detailed analysis of their technical approaches, pricing structures, and practical limitations. The Technical Fundamentals of Context Caching When interacting...

Read Post

Title header image: Explain it Like I'm Five: What the Heck is an MCP Server?

April 28, 2025

Engineering, Phase 2 Labs

Explain It Like I’m Five: What the Heck Is an MCP Server?

Unless you've recently spent your free time nose-deep in GitHub repos or interrogating ChatGPT like it's a barista who got your coffee order wrong, you may not be familiar with MCP servers. That’s okay. Two weeks ago, I wasn’t either. But thanks to a casual, "Hey, can you connect Notion to our project via an MCP server?" from a teammate (and my relentless need to avoid looking clueless), I dove headfirst into the rabbit hole. What I discovered is something potentially...

Read Post

April 8, 2025

Engineering, Phase 2 Labs

Building a Conversational Agent with LangGraph, Twilio, Modal and FastAPI

Imagine you’re an inventory manager for a retail business and you need to quickly verify stock levels for an unexpected large order while away from your computer. Instead of logging into a dashboard, navigating through menus, and analyzing spreadsheets, you simply send a text message: "Do we have enough iPhones to fulfill an order for 100 units?" Within seconds, you receive a reply: "You have 157 iPhones currently in stock, so you can fulfill this order. Based on your current sales rate of...

Read Post