Category Deep Dive

Foundation Models & LLMs

Daily signals and headlines

255 headlines across 77 days

Recent Scores

Teaching Claude Why

Anthropic published research on improving Claude's reasoning and interpretability capabilities.

Hacker News

SubQ: Sub-Quadratic LLM

Sub-quadratic language model with efficiency improvements over traditional LLM architectures.

Hacker News

Grok 4.3

X.AI releases Grok 4.3, an updated frontier model with new capabilities.

Hacker News

Mistral Medium 3.5

Mistral releases Medium 3.5 foundation model with enhanced capabilities.

Hacker News

GPT‑5.5 Bio Bug Bounty

OpenAI launches bug bounty program for GPT-5.5 focused on biological research safety vulnerabilities.

Hacker News

GPT-5.5

OpenAI releases GPT-5.5, generating significant community discussion with 1396 points and 919 comments.

Hacker News

ChatGPT Images 2.0

OpenAI launches ChatGPT Images 2.0 with improved reasoning, interactive editing, and 2K output resolution.

Hacker News

Claude Design

Anthropic announces Claude Design, highlighting advances in their flagship foundation model.

Hacker News

Claude Opus 4.7

Anthropic releases Claude Opus 4.7 with improved vision, memory, and instruction-following capabilities.

Hacker News

Trinity Large Thinking

New large thinking model Trinity released on OpenRouter for extended reasoning tasks.

Hacker News

GPT‑5.4 Mini and Nano

OpenAI releases smaller, more efficient variants of GPT-5.4 optimized for cost-sensitive deployment and agent applications.

Hacker News

Mistral AI Releases Forge

Mistral AI launches Forge, an enterprise platform enabling organizations to build and customize proprietary AI models with their own data.

Hacker News

LLM Architecture Gallery

Comprehensive reference guide showcasing various LLM architecture designs and patterns.

Hacker News

GPT‑5.3 Instant

OpenAI releases GPT-5.3 Instant model as latest advancement in foundation model capabilities.

Hacker News

Gemini 3.1 Pro

Google releases Gemini 3.1 Pro, outperforming Claude 4.6 Opus and GPT-5.2 on reasoning benchmarks with 1-million-token context window.

Hacker News

Claude Sonnet 4.6

Anthropic releases Claude Sonnet 4.6, advancing frontier language model capabilities.

Hacker News

Gemini 3 Deep Think

Google releases Gemini 3 Deep Think, advancing frontier model capabilities.

Hacker News

GPT‑5.3‑Codex‑Spark

OpenAI introduces GPT-5.3-Codex-Spark, a specialized coding model with 15x faster inference.

Hacker News

OpenAI starts testing ads in ChatGPT

OpenAI has begun testing advertising placements for some ChatGPT users in the US, signaling a potential new revenue stream for the consumer product.

Engadget