Cassie Breviu

Cassie Breviu

Software Engineer & Product Leader · Windows @ Microsoft

Building at the intersection of AI and product for over a decade. I currently lead engineering and product projects in the Windows org at Microsoft. Previously I built end-to-end ML solutions across startups and enterprise, from PyTorch model authoring to production inference at scale. Passionate about turning research into real products people actually use.

Featured
Reframing How You See Engineering Value: Our Evolving Mindset as Developers Using Agentic Engineering Tools
Latest Post

Reframing How You See Engineering Value: Our Evolving Mindset as Developers Using Agentic Engineering Tools

There's a lot of noise about what engineering value means in the age of agentic tools. The journey from excitement to fear to real engagement reveals something important: your value was never in the code you type it's in the judgment, systems thinking, and experience behind it.

Read →
Projects & Writing
Concepts for Orchestrating the Agentic Future

Concepts for Orchestrating the Agentic Future

The future of computing is rapidly evolving toward an agentic paradigm where AI-powered systems work collaboratively to enhance human productivity and creativity. This transformation requires a comprehensive approach that seamlessly integrates diverse tools, applications, and agents into a unified ecosystem. The orchestration of this agentic future isn't just about implementing LLM Models, it's about creating an intelligent infrastructure that connects everything we already have while paving the way for what's next in a scalable way.

Microsoft Build Session: Code-First LLMOps from prototype to production with GenAI tools | BRK110

Microsoft Build Session: Code-First LLMOps from prototype to production with GenAI tools | BRK110

Learn how to get from prototype to production with LLMOps and new developer tools for iterative debugging, evaluation, deployment, and monitoring. We’ll test and troubleshoot a copilot app using the code-first prompt flow SDK and frameworks like LangChain, interactively debugging an LLM chat application with VS Code. Then we’ll make sure it meets our thresholds for performance and safety, ship it, and ensure we can collect critical signals in production.

Empowering Your Business with AI: Introducing AI Application Templates

Empowering Your Business with AI: Introducing AI Application Templates

AI project starter templates for common GenAI scenarios. What is an AI project starter template anyway? These are GitHub repositories that have end-to-end solutions to jumpstart your AI endeavors. These templates serve as a foundation, allowing companies to harness AI benefits without reinventing the wheel.

GenAI RAG Chat Application

GenAI RAG Chat Application

This sample has the full end-to-end process of creating RAG application with Prompty. It includes GPT 3.5 Turbo LLM application code, evaluations, deployment automation with AZD CLI, GitHub Actions for evaluation and deployment, an monitoring for production solutions.

AI Image Generation: Stable Diffusion in C#

AI Image Generation: Stable Diffusion in C#

This is the first and only all C# implemenation of the Stable Diffusion Image creation model.

OpenAI Whisper AI Model Implemented in C#

OpenAI Whisper AI Model Implemented in C#

This project demonstrates the use of Olive to optimize an AI model for specific hardware, utilizing the Whisper Tiny English version as a case study. The optimized model is then deployed to the device and using ONNX Runtime we can execute both local and cloud-based inference.

Unreal Engine AI Real Time Style Transfer on Device

Unreal Engine AI Real Time Style Transfer on Device

Apply a Style Transfer Neural Network in real time with Unreal Engine 5 leveraging ONNX Runtime.

Co-Authored of PyTorch: Learn the Basics

Co-Authored of PyTorch: Learn the Basics

Co-Authored the Learn the Basics tutoral for creating deep learning models with PyTorch on Pytorch.org.