Here's a paper from MIT that covers how this could be resolved in an interesting...

kridsdale3 · 2025-08-12T22:03:57 1755036237

The fact that this is happening is where the tremendous opportunity to make money as an experienced Software Engineer currently lies.

For instance, a year or two ago, the AI people discovered "cache". Imagine how many millions the people who implemented it earned for that one.

giancarlostoro · 2025-08-13T01:28:30 1755048510

I've been thinking the same, and its things that you don't need some crazy ML degree to know how to do... A lot of the algorithms are known... for a while now... Milk it while you can.

nxobject · 2025-08-15T23:31:53 1755300713

What we need are "idea dice" or "concept dice" for CS – each side could have a vague architectural nudge like "parallelize", "interpret", "precompute", "predict and unwind", "declarative"...

mamp · 2025-08-12T18:33:47 1755023627

Unfortunately, I think the context rot paper [1] found that the performance degradation when context increased still occurred in models using attention sinks.

1. https://research.trychroma.com/context-rot

giancarlostoro · 2025-08-13T15:14:08 1755098048

Saw that paper have not had a chance to read it yet, are there other techniques that help then? I assume theres a few different ones used.