How Balyasny Asset Management built an AI research engine fo…
OpenAI NewsBalyasny Asset Management 是一家全球性的多策略投资公司,拥有约 180 个分布在不同资产类别和区域的投资团队。面对一个竞争激烈且瞬息万变的行业环境,信念、精准和速度对成功至关重要。随着金融数据体量激增, Balyasny 看到用 AI 重新设计投资研究流程的机会。
2022 年末, Balyasny 组建了一个名为 Applied AI 的集中团队:约 20 名研究人员、工程师与领域专家,负责开发原生嵌入团队工作流的 AI 工具。他们的旗舰产品是一套 AI 投资研究系统,旨在像资深分析师那样推理、检索并采取行动。
“ AI 让我们的团队能更快、更有结构地用第一性原理思考,且能处理更多数据。”
— Charlie Flanagan,Chief AI Officer
传统研究工作流的局限
投资研究既复杂又时效性强。分析师要筛阅成千上万份文件——从市场数据、券商研究到监管文件,人工专业不可替代,但传统方法既耗时又难以放大复制。
现成的 AI 工具往往不能同时处理结构化与非结构化数据,缺乏工作流编排,也未按机构合规标准构建。 Balyasny 需要一个为此目的量身打造的系统:既要像分析师一样思考,又要具备机器般的速度,并能在严格合规边界内运行。
规模化 AI 的四点经验
- 在部署前评估模型
在任何模型投入生产前, Balyasny 构建了金融领域内最复杂的评估流水线之一,从 12+ 维度衡量模型表现,包括预测准确性、数值推理、情景分析以及对噪声输入的鲁棒性。这些评估基于公司内部基准、工具和专有金融数据运行。严格的流程揭示了 GPT‑5.4 系列在多步骤规划、工具执行和减少幻觉方面的优势。如今, Balyasny 在其 AI 系统中把 GPT‑5.4 作为推理引擎,并与内部模型并用,按任务的实证表现逐一选用。
“我们像评估投资那样评估模型:看基本面。 GPT‑5.4 证明了它能严谨地计划、推理并执行。”
— Su Wang,高级研究科学家
- 让用户与 AI 深度协作
Balyasny 决定将 OpenAI 直接纳入面向用户的工作流。 OpenAI 团队亲自观摩投资团队如何使用系统:哪些场景成功、在哪些地方有难点、在商业环境中高性能究竟是什么样子。这样的直观反馈加快了迭代、缩短了产品反馈闭环,并改善了模型在金融专用任务中的行为。作为前沿模型发布的设计伙伴, Balyasny 把来自真实分析师而非测试样例的见解带入了 OpenAI 的产品路线图。
“我们不只是告诉 OpenAI 我们需要什么,而是把怎么用展示给他们看。这改变了很多。”
— Jonathan Park,产品经理
- 设计持续反馈回路,而非静态工具
由于 AI 深度嵌入投资团队的日常工作,团队能实时收集结构化反馈,覆盖用户评估、结果审计到工具执行质量。这个闭环推动模型与编排层的快速改进。举例来说,套利并购团队的早期反馈显示,代理(agent)需要在新文件或新闻发布出现时不断重新评估交易概率。 Balyasny 迅速扩展了代理的规划能力和工具访问,将缓慢的人工流程替换为实时的概率监控。 - 中央化系统,局部定制
尽管各投资团队策略各异, Balyasny 采取了中央化的 AI 部署模式。其 Applied AI 团队开发核心组件——代理框架、工具链与合规护栏——再分发到各团队,同时对数据和工具实行权限隔离。这种 “ federated deployment ” 模式让每个投资团队能够根据资产类别(如宏观、商品、权益)开发和使用定制化的 AI 代理,而 Applied AI 则专注于扩展架构、研究与模型评估。此举也确保合规与监管标准在全公司范围内得到遵守——在以风险管理和数据安全为命门的行业中尤为重要。
“我们早期在 AI 上的投入得到了回报。如今,我们每个投资团队都可以在安全环境下、并获得实时专家指导的前提下决定如何把最新的 AI 应用到自己的流程中。”
— Kevin Byrne,Chief Operating Officer
几个小时内见效的工作方式
目前约 95% 的 Balyasny 投资团队在积极使用他们的 AI 平台,速度、产出质量和分析师体验都有可量化的改善:
- 过去需要数天的深度研究任务现在可在数小时内完成,代理能综合数万份文件,包括备案文件、券商研究、财报和专家电话纪要等;
- 一位 “ Central Bank Speech Analyst ” 将宏观情景分析时间从两天压缩至约 30 分钟;
- 一位 “ Merger Arbitrage Superforecaster ” 代理现在持续监控并更新交易概率,取代了定制表格和人工提醒。
更重要的是,分析师对产出更有信心。借助权限受限的工具、可追溯的推理路径和可测试的代理,他们能输出结构化、可解释的洞见,进而增强判断力并辅助人为决策。
Balyasny 正在推动其 AI 路线图,重点包括:
- 通过 Reinforcement Fine-Tuning( RFT )提升模型在复杂高价值任务上的表现;
- 跨金融领域更深的代理编排;
- 包含财务图表、报表和备案文件在内的多模态输入;
- 对未来前沿模型的领域适配评估。
“这就像多了一个永不遗忘的队友,始终标注来源,并在返回结果前把细节复核一遍。”
— Charlie Sweat,Portfolio Manager
加入新工作时代
全球有超过 100 万家企业正借助 OpenAI 取得实质性成果。若有兴趣,请联系销售。
Balyasny Asset Management (Balyasny) is a global, multi-strategy investment firm with approximately 180 investment teams across diverse asset classes and geographies. The firm operates in a highly competitive and dynamic industry where conviction, precision, and speed are all critical to success. Facing an increasingly complex market environment with surging volumes of financial data, Balyasny saw an opportunity to reimagine the investment research process using AI.
In late 2022, Balyasny established an Applied AI team: a centralized group of 20 researchers, engineers, and domain experts tasked with building AI-native tools that embed directly into team-level workflows. Their flagship product, an AI investment research system, is designed to reason, retrieve, and act like a skilled analyst.
“AI is enabling our teams to apply first principles thinking faster, across more data, and with more structure.”—Charlie Flanagan, Chief AI Officer
Addressing limitations of legacy research workflows
Investment research is complex, high-stakes, and time-sensitive. Analysts must parse through thousands of documents, from market data and broker research to regulatory filings. Human expertise remains essential, but traditional methods are time-consuming and difficult to scale.
Off-the-shelf AI tools often can’t handle structured and unstructured data together, lack workflow orchestration, and aren’t built to meet institutional compliance standards. Balyasny needed something purpose-built: an AI system that could think like an analyst, move at the speed of a machine, and work within strict compliance boundaries.
Four lessons from Balyasny’s approach to AI at scale
1. Evaluate models before deploying them
Before any models went into production, Balyasny built one of the most sophisticated evaluation pipelines in finance, measuring models across 12+ dimensions including forecasting accuracy, numerical reasoning, scenario analysis, and robustness to noisy inputs. These evaluations are run against Balyasny’s internal benchmarks, tools, and proprietary financial data.
This rigorous process surfaced strengths in the GPT‑5.4 model family, particularly in multi-step planning, tool execution, and hallucination reduction. Today, Balyasny uses GPT‑5.4 as a reasoning engine within their AI system, alongside internal models, which are selected task-by-task based on empirical performance.
“We evaluate models the way we evaluate investments: on fundamentals. GPT-5.4 proved it could plan, reason, and execute with real rigor.”—Su Wang, Senior Research Scientist
2. Foster deep collaboration between users and AI partners
Balyasny made a strategic decision to involve OpenAI directly in user-facing workflows. OpenAI teams observed directly how investment teams use their AI system: where it succeeds, where it struggles, and what high performance actually looks like in a commercial context.
That visibility led to faster iterations, tighter product feedback loops, and better model behavior in finance-specific tasks. As a design partner for frontier model releases, Balyasny has influenced the OpenAI roadmap by surfacing insights from actual analysts, not test cases.
“We didn’t just tell OpenAI what we needed. We showed them. And that made all the difference.”—Jonathan Park, Product Manager
3. Design for feedback loops, not static tools
Because AI is deeply embedded in the day-to-day workflows of investment teams, they can collect structured feedback in real time on everything from user evaluations and outcome audits to tool execution quality. That loop drives rapid improvements to both models and the orchestration layer.
For example, early feedback from merger arbitrage teams revealed that agents needed to continuously re-evaluate deal probabilities as new filings or press releases came in. The Balyasny team quickly extended agent planning capabilities and tool access, replacing a slow, manual workflow with real-time probabilistic monitoring.
4. Centralize your AI system, and customize locally
While each investment team has a distinct investment strategy, Balyasny took a centralized approach to AI deployment. Their Applied AI team develops core components, including agent frameworks, toolchains, and compliance guardrails, which are then deployed across teams with scoped access to data and tools.
This “federated deployment” model means each investment team can develop and use AI agents tailored to their asset class (for example, macro, commodities, and equities), while the Applied AI team focuses on scaling architecture, research, and model evaluations. It also ensures that compliance and regulatory standards are universally respected—critical in an industry where risk management and data security are non-negotiable.
“Our early investments in AI paid off. Today, every one of our investment teams can decide how to apply the latest AI to their process, in a secure environment and with real-time expert guidance.”—Kevin Byrne, Chief Operating Officer
A playbook delivering results in hours—not days
Today, ~95% of Balyasny investment teams actively use their AI platform, with measurable impact across velocity, output quality, and analyst experience:
- Deep research tasks that once required days are now completed in hours, with agents synthesizing tens of thousands of documents, including filings, broker research, earnings, and expert calls.
- A Central Bank Speech Analyst cut macroeconomic scenario analysis time from 2 days to ~30 minutes.
- A Merger Arbitrage Superforecaster agent now monitors and updates deal probabilities continuously, replacing bespoke spreadsheets and manual alerts.
Just as importantly, analysts at Balyasny report higher confidence in outputs. With scoped tools, traceable reasoning paths, and testable agents, they use AI to deliver structured, explainable insights that increase conviction and inform human decision making.
Balyasny continues to expand its AI roadmap with a focus on:
- Reinforcement Fine-Tuning (RFT) to sharpen model behavior on complex, high-value tasks
- Deeper agent orchestration across financial domains
- Multimodal inputs including financial charts, statements, and filings
- Evaluation of future frontier models for domain fit
“It’s like adding a teammate who never forgets, always cites sources, and double-checks the details before sending anything back.”—Charlie Sweat, Portfolio Manager
Join the new era of work
More than 1 million businesses around the world are achieving meaningful results with OpenAI.Contact sales
Generated by RSStT. The copyright belongs to the original author.