AI Stories

Building robust automated LLM evaluations and Google's AI advancements
Posted on April 19, 2025
How to build effective automated LLM evaluations and why Google's Gemini 2.5 and Gemma 3 are reshaping the AI landscape.

Read more
Google introduces Gemma 3, a lightweight model family, ranking 2 on Elo score, just behind DeepSeek-R1
Posted on March 17, 2025
Google just released Gemma 3, a family of lightweight multimodal models delivering performance comparable to larger models while running on a single GPU or TPU.

Read more
Agentic document extraction, less diverse language and more!
Posted on March 1, 2025
Many new models were released this week, with Claude 3.7 Sonnet and grok 3.0 the highlights. We discuss an agentic way of extracting useful information from PDFs, and how AI has had a measurable effect on online written text.

Read more
Can we trust AI to judge AI? And I'm speaking at Devworld!
Posted on February 22, 2025
This week's newsletter dives into the challenge of evaluating LLM output---how can we trust AI to judge AI. I will be speaking about this topic at Devworld conference in Amsterdam this week!

Read more
Poor long-context performance, model safety, AI and Love
Posted on February 15, 2025
Today we discuss how good language models really perform when given large amounts of context, results from a Microsoft survey that show that over-reliance on genAI impacts our critical thinking, and some recent safety concerns regarding Deepseek's latest reasoning model R1.

Read more
DeepSeek R1 viral success and more!
Posted on February 8, 2025
This week we present a nice visual on how DeepSeek's R1 was trained, discuss recent legal battles against AI in Europe, and show a new way of doing data science using a new reactive notebook called Marimo!

Read more
OpenAI's Agent, clickbait crisis in AI research and more
Posted on January 24, 2025
This week we have some exciting news from OpenAI that published a new agent, we see trends of expertly crafted clickbait research, and we review one augmentation technique that received a lot of attention recently.

Read more
Is AI progress hitting a wall?
Posted on December 30, 2024
Whether the AI survives the hype, I feel so fortunate to live in this interesting time in human history. We are already able to do things that would have seemed like unimaginable to our grandma. It makes me really appreciate how fast the industry has progressed.

Read more

Join Our Newsletter

If you want to be notified of updates on data and AI news and more, sign up for our newsletter!

Join Our Newsletter

Subscribe