Edition 37: From Blind to SuperVision
Welcome to the AviaryAI Newsletter!
Thanks for joining us as we explore the intersection of GenAI and finance with practical learnings and the latest relevant insights. Let’s get started.
This week you’ll learn:
- 👁️ AI achieves surgical breakthrough: Beyond 20/20 vision now possible
- 🎓 Gemini 2.0 demonstrates next-level interactive coaching
- 🎦 Why Google's video AI is leaving competitors behind
- 📚 Self-attention simplified: How AI understands context
- 📈 OpenAI's new for-profit business model
Subscribe to the AviaryAI Newsletter here.
GENERATIVE AI THIS WEEK
The coolest things we're watching and why you should care
AI Surgery Gives Blind Woman Perfect Vision
AI technology has helped achieve what was once thought impossible in laser eye surgery. By creating a digital twin of a patient's eyes, AI simulated thousands of surgical variations to determine the optimal approach before the actual procedure. The result? The first UK patient, who was legally blind without glasses, now has better than 20/20 vision. Clinical trials showed every patient achieved beyond 20/20 vision, with some reaching 20/10 - the absolute limit of human visual capability.
So what?
This breakthrough shows AI’s potential to transform complex decision-making. By running thousands of simulations, organizations can now perfect their approaches before real-world implementation. This eliminates the traditional trade-off between innovation and risk, allowing institutions to be bold in their solutions while maintaining safety and reliability.
Watch Google’s Gemini 2.0 be a Real-Time Programming Coach
In a recent demo from @MckayWrigley posted on X, we see Google’s Gemini 2.0 used as an interactive coding mentor that can observe and respond to real-world actions. The AI watched the user's screen, understood their actions in real-time, and provided contextual guidance for coding tasks - all while maintaining a natural conversation flow. When the user made mistakes or needed clarification, Gemini 2.0 could immediately spot issues and offer corrections, much like having an expert looking over your shoulder.
So what?
This real-time interaction represents a significant leap forward from traditional AI chatbots that rely solely on text prompts. When AI can observe, understand, and guide in real-time, it becomes less of a tool and more of a collaborative partner. Think beyond just coding - this technology could transform employee training, process monitoring, and quality control across any department.
Watch the demo here
Google's AI Video Generator Sets New Quality Standard
Google's new AI video generator, Veo 2, is demonstrating significantly better results than OpenAI's recently released Sora. In direct comparisons, Veo 2 showed superior understanding of real-world physics and safety, producing more realistic and coherent videos. The key difference? Google's access to YouTube's vast library of video content for training data—a resource they've confirmed other companies cannot use.
So what?
While companies rush to announce new AI capabilities, Google's success with Veo 2 shows that the quality and quantity of training data often matters more than being first to market. This serves as a valuable reminder for executives that successful technology implementation isn't about having the latest tools—it's about having the right foundation to make those tools work effectively.
GENERATIVE AI WORD OF THE DAY
Self-Attention
Self-attention is a mechanism that helps AI models understand context by figuring out how each word in a sentence relates to every other word. Think of it like a smart reader who, when looking at the word "bank" in "I went to the bank to deposit money," knows to connect it more strongly with "deposit" and "money," helping it understand this is about a financial institution, not a riverbank. The model assigns different levels of importance to these connections, allowing it to grasp the full meaning of phrases and generate more coherent responses.
GENERATIVE AI IN FINANCE
The latest news at the intersection of GenAI and Finance
OpenAI’s For-Profit Transition Plan
OpenAI recently announced its plan to restructure into a Delaware public benefit corporation (PBC), joining other AI companies like Anthropic in adopting this unique corporate structure. Unlike traditional corporations focused solely on profits, PBCs are legally required to pursue public benefits alongside financial returns. The move allows OpenAI to attract more investment while maintaining its commitment to developing AI that benefits humanity.
The restructuring separates OpenAI's operations: the PBC will run the business while the non-profit will focus on charitable initiatives in healthcare, education, and science. This structure provides a framework for OpenAI to compete with tech giants like Google while keeping its ethical commitments intact - though experts note that PBC status alone doesn't guarantee mission-aligned behavior.
So what?
The companies closest to advancing AI technology are choosing to build in structural safeguards. While most tech companies optimize for maximum flexibility and control, leading AI companies are voluntarily creating frameworks to balance innovation with responsibility. This suggests that sustainable AI development requires more thoughtful governance than traditional tech products - a valuable insight for any organization planning its AI strategy.
About AviaryAI
AviaryAI is the next evolution of financial interaction. Enhance your team with proactive outbound voice agents to welcome new members, encourage credit card activations, and drive non-interest revenue.