Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
An experimental tool for controlling reasoning depth in language models using explicit thinking tokens.
The Thinking Effort Processor is an experimental tool designed to control the reasoning depth of large language models by manipulating explicit thinking tokens. This repository provides a framework to dynamically adjust how much "thinking" a language model performs during text generation.
scale_factor parameter.<think> and </think> tokens.transformers and llama-cpp-python.Explore the repository to see how you can implement and experiment with reasoning depth in your language models!