Skip to content
1 min read chain-of-thought

START: Revolutionizing AI with a new way of thinking

Discover START: A New Generation of AI That Thinks Phased and Uses Problem-Solving Tools Learn how it works and how it will impact our planet.

 

Have you ever wondered why AI sometimes answers the wrong questions or gives you unreasonable information?

This problem is about to disappear because researchers from Alibaba and the University of Science and Technology of China have developed a new AI called START (Self-Taught Reasoner with Tools).

START is not just an ordinary AI, but it is an AI that thinks step by step and knows how to use thinking tools, just like we use calculators or computer programs to solve difficult problems.


How does START work? What makes START different?

START is an AI that builds on the concept of Large Reasoning Models (LRMs) or large language models that focus on reasoning.


START mechanism

(A little technical, you can skip it.)

Illustration of the operation of START from the research paper

START works through two main processes:

  1. Hint-infer :
    • At this stage, START inserts "hints" (hints). into the reasoning process to encourage the model to run external tools.
    • An example of a hint such as "Wait, maybe using Python here is a good idea."
  2. Hint Rejection Sampling Fine-Tuning (Hint-RFT) :
    • This step uses the results from the Hint-infer to be screened, scored, and refined to create a high-quality dataset (DSEED).
    • The dseed is then taken to fine-tune the base model (QwQ-32B-Preview) to create a START-0.
    • START-0 will be used to create a richer dataset (DSTART), which will lead to a final fine-tune to create a START.

How good is START? Examples of amazing talents

START is not just "good", but "very good".
Let's take a look at some examples of START's amazing capabilities.

What's even more amazing is that START can do these things without being told every step.

Surprisingly good (secretly scared slightly 😆)


Technology Behind START: QwQ-32B-Preview and Fine-Tuning

START is built on the basis of the QwQ-32B-Preview model, a highly efficient Large Language Model (LLM) that uses Python as an important tool to help think and process data.

START also uses a two-phase fine-tuning process to fine-tune the model for better reasoning and tooling.


How will START change our world? Application Potential

START has the potential to revolutionize and transform our world in many ways, such as:


The future of AI that "thinks" is not just "remembered"

START is not just an ordinary AI, but a giant leap forward in the field of artificial intelligence technology. It shows that AI can really "think", not just remember information to answer.

Although START still has some limitations, such as the ability to work with languages other than Python.

But it also opens the door to a new world of AI that is smarter, more context-sensitive, and ready to help humans solve more complex problems in the future.

Who knows, in the next few years, we may see AI that can interact and reason like a real human.

START could be the beginning of a major revolution in AI that will change our world forever.


Chat with research papers


References