World Lifestyler
  • Art & Culture
    • Architecture
    • Art & Exhibitions
    • Books
    • Design
    • Film & Music
  • Competitions
    • Dining Experiences
    • Hotel Stays
    • Luxury Experiences
    • Product Giveaways
    • Reader Exclusives
    • Travel Giveaways
  • Food & Drink
    • Chefs
    • Coffee Culture
    • Food Destinations
    • Recipes
    • Restaurants
    • Wine & Spirits
  • Lifestyle
    • Design
    • Fashion
    • Health & Wellbeing
    • Homes & Property
    • Love & Romance
  • People
    • Creatives
    • Entrepreneurs
    • Icons
    • Interviews
    • Profiles
    • Rising Talent
  • Travel
    • Adventure & Experience Travel
    • City Guides
    • Destinations
    • Hotels
    • Secret Spots
    • Travel Trends
  • Art & Culture
    • Architecture
    • Art & Exhibitions
    • Books
    • Design
    • Film & Music
  • Competitions
    • Dining Experiences
    • Hotel Stays
    • Luxury Experiences
    • Product Giveaways
    • Reader Exclusives
    • Travel Giveaways
  • Food & Drink
    • Chefs
    • Coffee Culture
    • Food Destinations
    • Recipes
    • Restaurants
    • Wine & Spirits
  • Lifestyle
    • Design
    • Fashion
    • Health & Wellbeing
    • Homes & Property
    • Love & Romance
  • People
    • Creatives
    • Entrepreneurs
    • Icons
    • Interviews
    • Profiles
    • Rising Talent
  • Travel
    • Adventure & Experience Travel
    • City Guides
    • Destinations
    • Hotels
    • Secret Spots
    • Travel Trends
No Result
View All Result
WORLD LIFESTYLER
No Result
View All Result
Home Press Releases

Leni Tops Four Major AI Benchmarks, Outperforming Systems from OpenAI, Anthropic, Google, and Perplexity

Cision PR Newswire by Cision PR Newswire
May 12, 2026
in Press Releases
Reading Time: 4 mins read
0
Share on FacebookShare on Twitter

NEW YORK, May 12, 2026 /PRNewswire/ — Leni, an AI-powered analytics platform for commercial real estate, today announced top-tier results on four independent AI benchmarks. Leni placed first on the DRACO Benchmark for deep research, in the top two on SpreadsheetBench Verified, outperformed every public model on BullshitBench, and ranked ahead of Genspark, Manus and OpenAI Deep Research on GAIA.


Leni is a purpose-built, accuracy-focused platform designed for enterprise grade investment and asset management work. Our expertise lies in CRE and adjacent sectors of investments, offering top-tier security, enterprise-specific context, and seamless industry data integrations for organizations committed to leveraging AI for serious day-to-day workflows and investment work. (PRNewsfoto/Leni)

“Most teams obsess over models, but the key engineering needed for effective AI adoption, which delivers highly accurate and reliable results for teams, relies on architecture or harness,” said Leni CEO and Co-Founder Arunabh Dastidar. “That’s why the most popular coding tool today is 98 percent harness and 2 percent models. We called it years ago and have produced purpose-built infrastructure that can reliably be used for serious work where accuracy and security are crucial. It shifts the work from babysitting and guessing to trusted, verifiable output, so teams can move faster with confidence.”

DRACO, developed by Perplexity AI and Harvard, measures whether AI can produce in-depth research that a senior analyst would sign off on. Leni scored 71.6 percent, ahead of the deep research products from Perplexity, Google, and OpenAI. SpreadsheetBench Verified, which grades AI on hundreds of real spreadsheet tasks, ranked Leni in the top two globally, completing 365 of 400 tasks correctly. On BullshitBench (Version 2), which tests whether AI pushes back on nonsensical questions instead of inventing an answer, Leni caught 98 percent of fabricated premises, ahead of all 142 public AI models on the leaderboard. GAIA, developed by Meta and HuggingFace, measures whether AI can complete real-world tasks that involve multiple steps without making mistakes early on, which would throw off the final answer. Leni scored 77.0 percent on the validation set, ahead of Genspark, Manus, and OpenAI Deep Research. In commercial real estate, where the margin for error is zero, these benchmarks measure whether a system can accurately produce the analysis that determines the closing of a deal.

The results matter because the gap between AI promise and AI reliability is costing companies real money, according to Dastidar. A staggering 99 percent of companies reported financial losses tied to AI-related risks, with an average loss of $4.4 million per company and an estimated $4.3 billion across the 975 respondents, according to an EY survey published in October 2025. The pattern is prevalent in commercial real estate, where 92 percent of CRE firms have piloted AI but only 5 percent say they have achieved all of their AI goals, according to JLL’s 2025 Global Real Estate Technology Survey.

“If I had to describe Leni’s impact, it’s simple: faster and easier,” said Scott Jones, Vice President of IT at Ram Realty Advisors. “On the asset management side in particular, teams are no longer stuck doing manual work. The data flows directly from the source, and they can trust it. Leni shifts the focus away from aggregating information and building reports to what actually matters: finding deals, executing them better, and running assets more effectively.”

Leni’s agentic AI platform is designed for investment, asset management, and operations teams across commercial real estate, pulling data from PDFs, spreadsheets, and core systems to execute complex workflows end to end. At the platform’s core is its Universal Data Model (UDM), the industry’s first standardized data framework for multifamily real estate, developed over three years by a team that includes alums from MIT, Greystar, EY, and Geoffrey Hinton’s Vector Institute. The UDM creates a common language for a sector long defined by proprietary formats and data silos, integrating across every major real estate system. The result is secure, model-agnostic automation that delivers decision-ready outputs without requiring in-house AI infrastructure.

“Trust is the most important part of any AI system that a business actually uses,” said Leni’s Head of Industry Strategy, Marcio Sahade, who previously spent 14 years at firms such as Tishman Speyer and Hines. “If a team cannot rely on what comes back, they end up redoing the work themselves, and the AI never delivers on its promise.”

He added, “What these benchmarks measure is exactly that gap: whether a system can be trusted to produce finished work, not just plausible-sounding output. That is the bar we hold ourselves to with every customer.”

About Leni
Leni is a secure, accuracy-driven AI platform purpose-built for serious investment work across the commercial real estate, lending, and investment sectors. Since its public launch in 2023, the company has raised $8.5 million to build best-in-class AI infrastructure for the sector. Leni enables accurate, secure, and context-aware deliverables for investment and asset management teams. The platform today supports a total portfolio of over $40 billion assets under management. For more information, visit: http://www.leni.co.


Leni ranked No. 1 across all three GAIA difficulty levels, showing its ability to complete complex, multi-step tasks that require research, reasoning and reliable execution.

Cision View original content to download multimedia:https://www.prnewswire.com/news-releases/leni-tops-four-major-ai-benchmarks-outperforming-systems-from-openai-anthropic-google-and-perplexity-302769724.html

SOURCE Leni

Cision PR Newswire

Cision PR Newswire

Related Posts

ACQUALINA RESORT MARKS 20 YEARS WITH NEW DINING, WELLNESS AND LUXURY EXPERIENCES

May 12, 2026

HARRAH’S RESORT SOUTHERN CALIFORNIA AND THE RINCON TRIBE UNVEIL $13.1M RENOVATED HIGH LIMIT AREA

May 12, 2026

Snowhawk Announces Final Close of Inaugural Digital Infrastructure Fund with Approximately $1.3 Billion of Total Commitments

May 12, 2026

Higher Wine Spending Masks Ongoing Industry Challenges: 2026 BMO Wine Market Report

May 12, 2026

Blue Hour Studios Launches Board of Creators to Connect Brands Directly to Culture

May 12, 2026

In HelloNation, Hydraulic Systems Expert Mike Bonner Discusses Choosing Between Hydraulic Repair & Replacement

May 12, 2026

Popular News

  • HARRAH’S RESORT SOUTHERN CALIFORNIA AND THE RINCON TRIBE UNVEIL $13.1M RENOVATED HIGH LIMIT AREA

    0 shares
    Share 0 Tweet 0
  • ACQUALINA RESORT MARKS 20 YEARS WITH NEW DINING, WELLNESS AND LUXURY EXPERIENCES

    0 shares
    Share 0 Tweet 0
  • Higher Wine Spending Masks Ongoing Industry Challenges: 2026 BMO Wine Market Report

    0 shares
    Share 0 Tweet 0
  • Snowhawk Announces Final Close of Inaugural Digital Infrastructure Fund with Approximately $1.3 Billion of Total Commitments

    0 shares
    Share 0 Tweet 0
  • Blue Hour Studios Launches Board of Creators to Connect Brands Directly to Culture

    0 shares
    Share 0 Tweet 0

About & Contact

  • About Us
  • Branding Style Guide
  • Contact Us
  • Help Centre
  • Media Kit
  • Site Map

Explore Content

  • Events
  • Newsletter
  • Press Releases
  • Topics

Legal & Privacy

  • Advertiser & Partner Policy
  • Communications & Newsletter Policy
  • Contributor Agreement
  • Copyright Policy
  • Privacy Policy
  • Prohibited Content Policy
  • Terms of Service

Tiny Media Brands

  • Silicon Valleys Journal
  • The AI Journal
  • The City Banker
  • The Wall Street Banker
  • World Lifestyler

© 2025 World Lifestyler

No Result
View All Result
  • Home

© 2025 World Lifestyler