Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That is RNNs. LLMs (usually transformers) are non-markovian.


> That is RNNs. LLMs (usually transformers) are non-markovian.

How? They have a hard cut off context, and randomly generates next state from that. That is the definition of a markov chain.

It is a markov chain with a pretty large state at every point, but still a markov chain.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: