ai
A list of content tagged ai
Blogs
- Generating Book Covers Using AI
- Webmentions are back thanks to GitHub Copilot
- Taking Claude Code on the web for a spin
- How do I keep up with AI?
- Vibe-Specing - From concepts to specification
- Llama's Turn On: Tuning In to AI's Quest for Higher Consciousness in MOOs
- Digitize Analog Bookmarks using AI, .NET, and GitHub Models
- Configure Ollama on Dev Containers and VS Code
- Getting started with Ollama on Windows
- Using Generative AI to produce Spotify Clips
- AI like it's 1999 or 1899
- Quick thoughts about Snapdragon Summit 2023
- Deploy ML.NET Machine Learning Model in Blazor WebAssembly Static Website
- Use machine learning to categorize web links with F# and ML.NET
- Restaurant Inspections ETL & Data Enrichment with Spark.NET and ML.NET Automated (Auto) ML
- The Case for Doing Machine Learning with F#
- Operationalizing Machine Learning with ML.NET, Azure DevOps and Azure Container Instances
- Serverless Machine Learning with ML.NET and Azure Functions
- Deploy .NET Machine Learning Models with ML.NET, ASP.NET Core, Docker and Azure Container Instances
Notes
- FOSDEM 2026 starts tomorrow
- Quick Thoughts on Snapdragon Summit 2025
- Copilot, add new features. But first, coffee
- Ollama Adds Mistral 3.1 Support
- Tinkering with DeepSeek R1, GitHub Models, and .NET on stream
- Spotify Wrapped 2024 AI Generated Podcast
- .NET Conf 2024 Bound
- Clock Tables - Org Mode, Plain Text, and AI
- These models are too damn big!
- Use AI to generate a blogroll others can subscribe to
- New Era of Work - Windows / Surface Event Blog (March 21, 2024)
- Book Review - Agency
- Down the weird web
- What about instrumentals? AI Generated Spotify Clips Addendum
- AI abundance after scarcity cycles
- Quick Thoughts Snapdragon Summit 2023 Addendum
- New AI generated phone wallpaper
- Dall-E Outpainting and generative models
- Web Neural Network API - Working Draft
- Next gen stick figures using AI and NVIDIA Canvas
Responses
- Orchestrate teams of Claude Code sessions
- Building a C compiler with a team of parallel Claudes
- Claude Opus 4.6
- Introducing GPT-5.3-Codex
- Voxtral transcribes at the speed of sound
- Owning a $5M data center
- Musk's Starlink updates privacy policy to allow consumer data to train AI
- How AI assistance impacts the formation of coding skills
- Introducing Prism
- Open Coding Agents: Fast, accessible coding agents that adapt to any repo
- Kimi K2.5: Visual Agentic Intelligence
- Interactive tools in Claude
- The Computational Web and the Old AI Switcharoo
- Gas Town’s Agent Patterns, Design Bottlenecks, and Vibecoding at Scale
- Qwen3-TTS Family is Now Open Sourced: Voice Design, Clone, and Generation!
- Claude’s Constitution
- The assistant axis: situating and stabilizing the character of large language models
- I shipped code I don't understand and I bet you have too
- FLUX.2 [klein]: Towards Interactive Visual Intelligence
- Open Responses
- Our approach to advertising and expanding access to ChatGPT
- Why We Are Excited About Confessions
- How confessions can keep language models honest
- Advancing Claude in healthcare and the life sciences
- Introducing Cowork
- Uncharted Territory
- Why Didn’t AI “Join the Workforce” in 2025?
- How Markdown took over the world
- OpenAI launches ChatGPT Health
- Is It Finally Time To Fight Back Against Technology? (This Bestseller Says “Yes”)
- Manus Joins Meta: Accelerating AI Innovation for Businesses
- Prototypes Are the New PRDs
- Beware Unearned Wisdom
- Transformers v5: Simple model definitions powering the AI ecosystem
- Prompt caching: 10x cheaper LLM tokens, but how?
- 2025 LLM Year in Review
- Google Cloud’s Business Trends Report 2026: Key findings
- Introducing Metrax: performant, efficient, and robust model evaluation metrics in JAX
- Measuring AI Ability to Complete Long Tasks
- Evaluating Context Compression for AI Agents
- Multiplexing MCP Servers For Agentic Specialization
- Da2a: The Future of Data Platforms is Agentic, Distributed, and Collaborative
- The Seven Pillars of a Production-Grade Agent Architecture
- Inside the feature store powering real-time AI in Dropbox Dash
- Token-count-based Batching: Faster, Cheaper Embedding Inference for Queries
- Google's year in review: 8 areas with research breakthroughs in 2025
- Introducing Bolmo: Byteifying the next generation of language models
- We're open sourcing our MCP OIDC Provider
- Help boost your daily productivity with CC, a new experimental AI agent from Google Labs
- Native and Compact Structured Latents for 3D Generation
- Sharp Monocular View Synthesis in Less Than a Second
- Efficient Optimization With Ax, an Open Platform for Adaptive Experimentation
- Applying data loading best practices for ML training with Amazon S3 clients
- AlphaEvolve on Google Cloud: AI for agentic discovery and optimization
- Debugging Deep Agents with LangSmith
- Introducing Polly: Your AI Agent Engineer
- Validating LLM-as-a-Judge Systems under Rating Indeterminacy
- Postgres for Agents
- Agent Engineering: A New Discipline
- Memori Docs
- From Single-Node to Multi-GPU Clusters: How Discord Made Distributed Compute Easy for ML Engineers
- Unlocking Peak Performance on Qualcomm NPU with LiteRT
- Introducing Meta Segment Anything Model 3 (SAM 3)
- Reducing Privacy leaks in AI: Two approaches to contextual integrity
- The state of enterprise AI
- SOTW 2025:The Year WordPress Became AI-Native
- Titans + MIRAS: Helping AI have long-term memory
- Writing a good CLAUDE.md
- STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
- Fara-7B: An Efficient Agentic Model for Computer Use
- WorldGen — Text to Immersive 3D Worlds
- Build with Nano Banana Pro, our Gemini 3 Pro Image model
- SIMA 2: An Agent that Plays, Reasons, and Learns With You in Virtual 3D Worlds
- A new era of intelligence with Gemini 3
- Introducing Google Antigravity
- Private AI Compute advances AI privacy
- Announcement: Pydantic AI Gateway Open Beta | Pydantic
- Google Colab is Coming to VS Code
- Exploring a space-based, scalable AI infrastructure system design
- Introducing Code Wiki: Accelerating your code understanding
- Piloting group chats in ChatGPT
- MMCTAgent: Enabling multimodal reasoning over large video and image collections
- Agent Sandbox - Agentic AI on Kubernetes and GKE
- Model Equivalence using Z3
- RL Learning with LoRA: A Diverse Deep Dive
- The algorithm failed music
- There is no such thing as a tokenizer-free lunch
- ML Kit’s Prompt API: Unlock Custom On-Device Gemini Nano Experiences
- RL without TD learning
- Embedding Atlas
- Towards Humanist Superintelligence
- Building the Open Agent Ecosystem Together: Introducing OpenEnv
- Custom agents for GitHub Copilot
- llamafile Returns
- Introducing Agent HQ: Any agent, any way you work
- On-Policy Distillation
- Vercel Coding Agent Template
- Introducing vibe coding in Google AI Studio
- Introducing PyTorch Monarch
- SentinelStep: Building agents that can wait, monitor, and act
- C# AI Buddy: Early Preview
- Introducing ExecuTorch 1.0
- Ray Comes to the PyTorch Foundation
- The Way of Code
- Introducing ChatGPT Atlas
- Diffusion Beats Autoregressive in Data-Constrained Settings
- VaultGemma: The world's most capable differentially private LLM
- Announcing MySQL AI
- mmBERT: ModernBERT goes Multilingual
- Introducing Dreamer 4
- IBM Granite-Docling: End-to-end document understanding with one tiny model
- Introducing the Data Commons Model Context Protocol (MCP) Server
- Introducing Gemini CLI extensions
- Claude Code on the web
- DeepSeek-OCR: Contexts Optical Compression
- Equipping agents for the real world with Agent Skills
- Coral NPU: A full-stack platform for Edge AI
- Introducing Claude Haiku 4.5
- Nanochat
- Introducing CodeMender: an AI agent for code security
- Introducing the Gemini 2.5 Computer Use model
- Managing context on the Claude Developer Platform
- Writing effective tools for AI agents
- Effective context engineering for AI agents
- Sora 2 is here
- Introducing Claude Sonnet 4.5
- Introducing ChatGPT Pulse
- Start and track Copilot coding agent tasks in GitHub Mobile
- Memvid
- GitHub Copilot CLI public preview
- Ollama Web Search
- Chrome DevTools (MCP) for your AI agent
- Qwen3-Omni
- Agent Payments Protocol (AP2)
- Windows ML is generally available
- Introducing Notion 3.0
- Omnineural 4B
- Agents.md
- Seemingly Conscious AI
- Introducing Gemma 3 270M
- Self-Adapting Language Models (SEAL)
- How we built our multi-agent research system - Anthropic
- State-Of-The-Art Prompting For AI Agents
- Introducing ElevenLabs Conversational AI 2.0
- The Darwin Gödel Machine - AI that improves itself by rewriting its own code
- Agent Network Protocol - The HTTP of the Agentic Web Era
- Claude Artifacts
- GPT Image 1 - Image Generation API
- The Llama 4 herd: The beginning of a new era of natively multimodal AI innovation
- Introducing Gemma 3
- Introducing Command A
- Simon Willison's AI-Generated Tools Colophon
- Mistral Small 3.1
- Introducing AX: Why Agent Experience Matters
- Introducing Mercury, the first commercial-scale diffusion large language model
- Nomic Embed Text V2: An Open Source, Multilingual, Mixture-of-Experts Embedding Model
- Claude 3.7 Sonnet and Claude Code
- The Ultra-Scale Playbook: Training LLMs on GPU Clusters
- Dream Job - Google Super Bowl 2025 Ad
- Languages & Runtime Community Standup - Tensors in .NET
- Generative AI for Beginners (.NET) is now available
- AI Dev Gallery now in the Microsoft Store
- Introducing deep research
- Semantic Search and On-Device ML in Emacs
- Swarm navigation of cyborg-insects in unknown obstructed soft terrain
- AI Subtitles Are Coming to VLC
- LangChain State of AI 2024 Report
- MarkItDown - Convert files to Markdown
- Build a YouTube chat app with .NET
- Sora is here
- Day 1 of .NET Conf
- Microsoft Research: Introducing DRIFT Search
- Microsoft Research Focus: Week of October 28, 2024
- In-Context LoRA for Diffusion Transformers
- Thinking LLMs: General Instruction Following with Thought Generation
- Join me at DEVintersection in Las Vegas - September 10-12
- Bringing Llama 3 to life
- Transformers in music recommendation
- Reddit says companies must pay for data access
- Tensors from scratch series
- Meta AI's Segment Anything Model (SAM) 2
- Zombie Internet
- Meta Large Language Model Compiler: Foundation Models of Compiler Optimization
- Deep Questions - Debunking AI Model Capabilities / Distributed Webs of Trust
- Mapping the Mind of a Large Language Model
- Claude's Character
- Ultravox - An open, fast, and extensible multimodal LLM
- Apple - Private Cloud Compute
- Introducing Apple’s On-Device and Server Foundation Models
- Introducing Apple Intelligence
- The Verge - Apple WWDC 2024 keynote in 18 minutes
- Andrej Karpathy - Let's reproduce GPT-2 (124M)
- TinyAgent: Function Calling at the Edge
- Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20
- Introducing Snowflake Arctic
- SAMMO: A general-purpose framework for prompt optimization
- Google Penzai
- Introducing Phi-3
- Introducing Llama 3
- RecurrentGemma - Open weights language model from Google DeepMind, based on Griffin.
- ARAGOG: Advanced RAG Output Grading
- Introducing Stable Audio 2.0
- Start using ChatGPT instantly
- Announcing DBRX: A new standard for efficient open source LLMs
- One-step Diffusion with Distribution Matching Distillation
- Stability CEO Resigns
- Mamba: Linear-Time Sequence Modeling with Selective State Spaces
- Quanto: a PyTorch quantization toolkit
- Demystifying Embedding Spaces using Large Language Models
- The Tokenizer Playground
- Introducing Stable Video 3D: Quality Novel View Synthesis and 3D Generation from Single Images
- Nvidia reveals Blackwell B200 GPU
- LaVague - Large Action Model framework
- Spreadsheets are all you need
- Building Meta’s GenAI Infrastructure
- OpenAI Transformer Debugger
- Grok-1
- Ollama now supports AMD graphics cards
- What I learned from looking at 900 most popular open source AI tools
- You can now train a 70b language model at home
- Levels of Complexity: RAG Applications
- Inflection-2.5: meet the world's best personal AI
- Training great LLMs entirely from ground up in the wilderness as a startup
- Stable Diffusion 3: Research Paper
- Wix’s new AI chatbot builds websites in seconds based on prompts
- Gemma PyTorch
- Introducing the next generation of Claude
- GGUF, the long way around
- Predictive Human Preference: From Model Ranking to Model Routing
- The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
- Tumblr and WordPress to Sell Users’ Data to Train AI Tools
- The latest Microsoft Copilot update on Android makes me mourn the death of Cortana
- Announcing Mistral Large
- GPT in 500 lines of SQL
- Jim Cramer says McDonald’s embracing AI at drive-thrus is good news for Nvidia
- Stable Diffusion 3 - Early Preview
- The killer app of Gemini Pro 1.5 is video
- HuggingChat
- Gemma: Introducing new state-of-the-art open models
- Cosmopedia v0.1
- MLX Swift - On-device ML research with MLX and Swift
- Ollama - Windows Preview
- V-JEPA: The next step toward Yann LeCun’s vision of advanced machine intelligence (AMI)
- OpenAI Sora - Creating video from text
- Introducing Gemini 1.5
- The text file that runs the internet
- NVIDIA Chat with RTX
- Stable Cascade
- Memory and new controls for ChatGPT
- Introducing Nomic Embed: A Truly Open Embedding Model
- Ollama - Python & JavaScript Libraries
- Google’s Hugging Face deal puts ‘supercomputer’ power behind open-source AI
- FOSDEM 2024 Schedule
- NightShade
- Introducing Stable LM 2 1.6B
- Talking about Open Source LLMs on Oxide and Friends
- LeftoverLocals: Listening to LLM responses through leaked GPU local memory
- More than an OpenAI Wrapper: Perplexity Pivots to Open Source
- Conor McGregor pitching Zune and Windows Phone to Cristiano Ronaldo
- Phi-2 now on HuggingFace
- Generative AI and .NET - Part 2 SDK
- Generative AI and .NET - Part 1 Intro
- Supporting the Open Source AI Community
- GPT-4 API general availability
- How generative AI is changing the way developers work
Bookmarks
- JordanMarr/Agent.NET: A composable AI agent framework for .NET.
- Neural Networks: Zero To Hero
- Top 100 Claude Code Recipes for Knowledge Workers
- The Best Open-Source Small Language Models (SLMs) in 2026
- DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
- Introduction to Agents Whitepaper
- CUGA on Hugging Face: Democratizing Configurable AI Agents
- Carnegie Mellon at NeurIPS 2025
- DS-STAR: A state-of-the-art versatile data science agent
- Claude Use Cases
- Less is More: Recursive Reasoning with Tiny Networks
- The state of AI in 2025
- Introducing Nested Learning: A new ML paradigm for continual learning
- Google Private AI Compute Technical Brief
- Why It’s Better for Us to Think of AI as a Tool than as a Worker
- Mathematical exploration and discovery at scale
- Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
- Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
- Project Gutenberg AI Books
- Tensor Logic: The Language of AI
- Wikidata Embedding Project
- LoRA Without Regret
- Proof of Thought : Neurosymbolic Program Synthesis allows Robust and Interpretable Reasoning
- Video models are zero-shot learners and reasoners
- Getting AI to Work in Complex Codebases
- Recursive Open Meta Agent (ROMA)
- Virtual Agent Economies
- How people use ChatGPT
- Real Simple Licensing
- Claude Code Emacs Integration
- Introducing AutoRound
- Parakeet TDT 0.6B V2 (En)
- Introducing Locate 3D
- ZeroSearch - Incentivize the Search Capability of LLMs without Searching
- A Survey of AI Agent Protocols
- TeLoGraF: Temporal Logic Planning via Graph-encoded Flow Matching
- CORG: Generating Answers from Complex, Interrelated Contexts
- Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks
- s1: Simple test-time scaling
- HuggingFace AI Agents Course
- The Illustrated DeepSeek-R1
- Agents
- Agents Whitepaper
- NotebookLlama: An Open Source version of NotebookLM
- Transformer Explainer: Interactive Learning of Text-Generative Models
- HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction
- GPT-4o System Card
- LongROPE: Extending LLM Context Window Beyond 2 Million Tokens
- Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
- DE-COP: Detecting Copyrighted Content in Language Models Training Data
- Using LLM to select the right SQL Query from candidates
- Large Language Models Are Zero-Shot Time Series Forecasters
- ReALM: Reference Resolution As Language Modeling
- Releasing Common Corpus: the largest public domain dataset for training LLMs
- Machine Learning for Games Course - HuggingFace
- MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
- Diffusion Models From Scratch
- Diffusion models from scratch, from a new theoretical perspective
- Fast Inner-Product Algorithms and Architectures for Deep Neural Network Accelerators
- The AI Study Guide: Azure Machine Learning Edition
- HuggingFace - Open Source AI Cookbook
- Magic.dev
- GraphRAG: Unlocking LLM discovery on narrative private data
- LangChain - OpenGPTs
- Eagle 7B : Soaring past Transformers with 1 Trillion Tokens Across 100+ Languages (RWKV-v5)
- OpenAI Microscope
- Stable Code 3B - Coding on the Edge
- Sampling for Text Generation
- SingSong - Generating musical accompaniments from singing
- AI for economists - prompts and resources
- Every - Daily Newsletter
- My AI Timelines Have Sped Up (Again)
- Introducing the GPT Store
- Ferret: Refer and Ground Anything Anywhere at Any Granularity
- VideoPoet: A large language model for zero-shot video generation
- Midjourney v6
- LangChain State of AI 2023
- LLM in a flash: Efficient Large Language Model Inference with Limited Memory
- OpenAI - Prompt engineering
- Solo - an AI website builder for solopreneurs
- MemoryCache - Augmenting Local AI with Browser Data
- Mozilla Innovation Week - Explore the Future of AI with Mozilla
- Bash One-Liners for LLMs
- Mixtral 8x7B on Apple Silicon with MLX
- Steering at the Frontier: Extending the Power of Prompting
- promptbase
- LLM360: Towards Fully Transparent Open-Source LLMs
- Phi-2: The surprising power of small language models
- Answer.AI - A new old kind of R&D lab
- Introducing Stable LM Zephyr 3B
- The Geometry of Truth: Dataexplorer
- State of AI Report - 2023
- OnnxStream - Stable Diffusion XL 1.0 Base on a Raspberry Pi Zero 2
- Evaluating LLMs is a minefield
- MemGPT - Towards LLMs as Operating Systems
- Best Practices for LLM Evaluation of RAG Applications
- MLflow 2.8 with LLM-as-a-judge metrics and Best Practices for LLM Evaluation of RAG Applications
- Metadata-Curated Language-Image Pre-training (MetaCLIP) - Demystifying CLIP Data
- OpenAgents: An Open Platform for Language Agents in the Wild
- The Foundation Model Transparency Index
- The New Kings of Open Source AI (Oct 2023 Recap)
- Mixtral of experts
- SatCLIP - A Global, General-Purpose Geographic Location Encoder
- Long context prompting for Claude 2.1
- Introducing Gemini
- Introducing llamafile
- AI and Mass Spying
- AI Alliance Launches
- Understanding Deep Learning
- The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
- Chain-of-Verification Reduces Hallucination in Large Language Models
- Multimodality and Large Multimodal Models (LMMs)
- HuggingFace: Text Embeddings Inference
- Scaffolded LLMs as natural language computers
- Creating the First Confidential GPUs
- The AI Attack Surface Map v1.0
- Mistral 7B Model
- Meta announces AI experiences in Facebook, Instagram, WhatsApp
- Carton - Run any ML model from any programming language
- FlowiseAI - Drag and drop for LLM flows
- Why Open Source AI Will Win
- Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes
- vim + llm = 🔥
- Next-Gen CPU Acceleration: AVX For Generative AI
- ChatGPT can now see, hear, and speak
- Amazon and Anthropic announce strategic collaboration to advance generative AI
- DALL·E 3
- Optimizing your LLM in production
- Software²
- PointLLM: Empowering Large Language Models to Understand Point Clouds
- How consumers are using Generative AI
- Coqui 🐸 XTTS
- Introducing Würstchen: Fast Diffusion for Image Generation
- Efficient Controllable Generation for SDXL with T2I-Adapters
- Spread Your Wings: Falcon 180B is here
- Modular: Mojo🔥 - It’s finally here!
- Rethinking trust in direct messages in the AI era
- Perplexity: Interactive LLM visualization
- Can LLMs learn from a single example?
- Teaching with AI
- Llama 2 7B/13B are now available in Web LLM
- Making Large Language Models Work For You
- Consciousness is a Big Suitcase
- Introducing Code Llama, a state-of-the-art large language model for coding
- Announcing Python in Excel
- Large Language Models with Semantic Search
- Patterns for Building LLM-based Systems and Products
- HuggingFace Candle
- Open challenges in LLM research
- Jupyter AI Brings Generative AI to Notebooks
- Announcing xAI
- Introducing Keras Core: Keras for TensorFlow, JAX, and PyTorch.
- Orca: Progressive Learning from Complex Explanation Traces of GPT-4
- Gorilla: Large Language Model Connected with Massive APIs
- Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold
- StarCoder: A State-of-the-Art LLM for Code
- LoRA: Low-Rank Adaptation of Large Language Models
- Shap-E: Generating Conditional 3D Implicit Functions
- Massively Multilingual Speech (MMS)
- PaLM 2
- Transformers Agent
- Copilot for Docs
- ChatGPT Prompt Engineering for Developers
- ImageBind: One Embedding Space To Bind Them All
- Language models can explain neurons in language models
- PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware
- Wikipedia embeddings dataset
- LLaVA: Large Language and Vision Assistant
- WebGPU API
- Consistency Models
- Free Dolly
- Generative Agents: Interactive Simulacra of Human Behavior
- Koala: A Dialogue Model for Academic Research