Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Something is definitely going wrong with your Qwen setup, in the link you posted it starts and ends with a compaction step due to a 4k token context limit. Qwen 35b supports I think up to 200k+ context limit (though I run only with 128k), that seems to be a major source of the problem.


Good call, I need to check if LM Studio is misconfigured.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: