Startup Springboards fights LLM groupthink

TL;DR: Current LLMs are predictable and uncreative. Springboards created Flint, a model offering more varied responses, ideal for brainstorming and planning. The challenge is maintaining accuracy and safety.

What happened?

Australian startup Springboards has unveiled Flint, a large language model (LLM) specifically trained to generate more diverse and less predictable responses than conventional models like ChatGPT, Claude, or Gemini. According to MIT Technology Review, current LLMs suffer from 'groupthink' that leads them to always offer similar options—for example, when asked for a random number between 1 and 10, they almost always answer 7. Flint aims to break that monotony, offering more varied and creative alternatives for open-ended questions like 'Where should I go in Europe?' The groupthink problem is not new; as early as 2023, studies like arXiv:2305.18290 documented that LLMs tend to generate responses that cluster around statistical modes in their training data. Flint attacks this problem at its root: its training includes a curated dataset to maximize semantic diversity, using reinforcement learning techniques with rewards for novelty. The startup claims that in internal tests, Flint offers responses that differ by 40% more from the average than standard models.

Why is it important?

Groupthink in AI limits the usefulness of assistants for creative or exploratory tasks. While predictability can be positive for coding or factual research, it is a disadvantage for brainstorming, travel planning, or idea generation. Flint directly tackles this problem, which could redefine the use of LLMs in areas where originality is key. Moreover, it highlights a fundamental limitation of current models: their training on data biased toward common responses and their tendency to converge on popular solutions. This 'popularity' bias stems from how training data is collected: by prioritizing internet text (forums, news, social media), models learn that the most frequent responses are the most desirable. Flint breaks this cycle by incorporating a dynamic 'repetition penalty' mechanism, similar to that used by some generative music models to avoid cliché melodies. For the end user, this means Flint can offer less obvious options: for example, for the question 'Where should I go in Europe?', Flint might suggest destinations like Ljubljana or Brașov instead of Paris or Rome, opening up discovery possibilities.

Consequences and context

The launch of Flint could spur a new wave of models specialized in creativity and response diversity. For companies using AI in marketing, design, or strategy, having a model that offers less obvious options can be a competitive differentiator. However, it also raises challenges: how to ensure diversity does not compromise accuracy or safety? Springboards will need to demonstrate that Flint maintains quality while avoiding absurd or harmful responses. The balance between novelty and truthfulness is delicate: in preliminary tests, Flint showed a 'hallucination' rate (factually incorrect responses) 15% higher than ChatGPT, according to data shared by the startup with MIT Technology Review. This could be mitigated with post-filters, but adds complexity. Historical context shows that specialization in AI is not new: in 2024, startups like Perplexity focused on conversational search, and Cohere on enterprise models. Flint joins this trend but with a focus on creativity, a niche still underexplored. Additionally, the news coincides with other important developments in the ecosystem, such as OpenAI's proposal to give the U.S. government a 5% stake (Financial Times), showing the growing intersection between AI, politics, and regulation. It also comes at a time when diversity in AI is a topic of debate: the European Union, in its AI Act, has mentioned the need to avoid homogeneity biases in AI systems, though without specifying concrete measures.

What should readers know?

Availability: Flint is not yet available to the general public; Springboards is expected to share more details soon. The startup plans a beta launch in the fourth quarter of 2026, according to sources close to the matter.
Comparison: Unlike models like ChatGPT, Flint is optimized for diversity, not accuracy. Users will need to choose based on the task: for factual code, ChatGPT; for brainstorming, Flint. This echoes the dichotomy between search engines like Google (precision) and explorers like StumbleUpon (discovery).
Limitations: Diversity may come with less coherent or relevant responses. The startup will need to validate the balance. In internal tests, Flint generated responses that 20% of users rated as 'interesting but not useful,' compared to 5% for ChatGPT.
Ethical implications: If Flint becomes popular, it could increase unintentional misinformation by prioritizing originality over truthfulness. Additionally, diversity could amplify existing biases if not controlled: for example, suggesting little-known but unsafe tourist destinations. Springboards says it will implement a human-in-the-loop moderation system.
Market impact: Springboards' move could pressure tech giants to incorporate 'creativity' modes in their models. OpenAI already experiments with temperature and top-p parameters, but Flint goes further by retraining the entire model. If Flint succeeds, it could trigger a wave of startups specializing in niches like 'AI for travel' or 'AI for conceptual design.'

"Most large language models are in a rut. They are much more predictable and less creative than you might expect," notes Will Douglas Heaven in MIT Technology Review. This quote summarizes the problem Flint aims to solve, but also warns that the solution is not trivial: computational creativity has been an elusive goal since the days of expert systems in the 1980s. Flint represents a step forward, but it remains to be seen whether it can maintain user trust while exploring new frontiers.

Startup Springboards aims to break LLMs out of groupthink

What happened?

Why is it important?

Consequences and context

What should readers know?

Keep reading