Tools on kontextfenster

Kleiner, besser, billiger — und dann?

Sun, 26 Apr 2026 00:00:00 +0000

Alibaba hat letzte Woche Qwen3.6-27B veröffentlicht. 27 Milliarden Parameter. Es übertrifft Qwen3.5-397B in fast allen Coding-Benchmarks — ein Modell mit fast 15-mal so vielen Parametern. Auf SWE-bench Verified erreicht es 77,2 Punkte, der größere Vorgänger 76,2. Auf Terminal-Bench 2.0 ist der Abstand deutlicher: 59,3 zu 52,5.

Das ist keine Kleinigkeit. Aber es verdient auch keine Fanfare.

Was hier passiert, ist Destillation — die Kunst, ein kompakteres Modell mit dem Wissen eines größeren zu trainieren. Große Modelle generieren Trainingsbeispiele, kleine Modelle lernen davon. Das Ergebnis kann bei spezifischen Aufgaben besser abschneiden, weil es gezielter trainiert wurde. Qwen3.6-27B ist auf Coding spezialisiert. Sein größerer Vorgänger ist ein Generalist. Das ist kein fairer Vergleich — und trotzdem ist er informativ.

Context Is Not Memory

Tue, 07 Apr 2026 00:00:00 +0000

Every time you send a message to an AI system, it reads everything again. The entire conversation history. From the beginning. Your first question, its answer, your second question, its answer, all the way up to now. There’s no short-term memory holding things in place. There’s only this one window, and everything has to fit inside it.

That sounds like an implementation detail. It’s an architectural decision with consequences.

The window has a limit. Earlier for smaller models, later for larger ones, but it’s always there. When a conversation runs long enough, the oldest content falls out. Not because the model forgets, but because it was never really stored. It was just text in the window.

Four Tools, One Task

Tue, 07 Apr 2026 00:00:00 +0000

A recent survey of over 900 developers found that 70 percent use between two and four AI tools at the same time. 15 percent use five or more. The question nobody is asking out loud: why so many?

The obvious answer is specialization. Cursor for work inside the editor, Claude Code for tasks spanning multiple files, a chatbot for everything else. Different tools for different layers of the workflow. That sounds reasonable.