AI Visibility Rankings · AI

AI evaluation platforms: AI search visibility ranking (2026)

By monitoraeo Last updated July 18, 2026 Refreshed every 30 days

How AI search engines rank ai evaluation platforms by visibility and citations. 20 brands measured monthly across Google AI Mode: which brands the AI names in answers, which domains it cites as sources, and how the leaders compare. AI evaluation platforms used to benchmark LLM outputs, score quality, automate evals, and improve reliability before and after deployment. Composite score: 70% visibility (% of AI answers naming the brand) + 30% citation rate (% citing the brand's domain). Full methodology →

Avg visibility across category

13%

Avg citation rate

20/20

Brands successfully audited

X LinkedIn

When buyers inquire about AI evaluation platforms, AI engines frequently highlight Braintrust as the leader, significantly ahead of the pack with a visibility rate of 37.5%. Other notable mentions include Maxim AI and LangSmith, though they trail behind Braintrust in both visibility and citations. Braintrust's reputation shines through as the most cited brand, standing at an impressive 75.0%.

The rankings are influenced by which sources AI engines pull data from. Top citations come from domains like getmaxim.ai, braintrust.dev, and broader platforms like reddit.com and medium.com. These sources act as aggregator and review sites, playing a pivotal role in shaping the visibility and authority of each brand in the list.

For buyers navigating this page, it's essential to consider the AI engines' preferences for brands with robust third-party validation. Recent reviews and demonstrated topical authority, especially on specific subcategories, weigh heavily in AI evaluations. Selecting a platform should thus involve examining how well these brands are cited by trusted sources and communities online.

At a glance

Category leader Braintrust 38% visibility · named in 3 of 8 AI answers

Most cited brand Braintrust 75% citation rate · the AI's most-trusted source brand in ai evaluation platforms

Top cited domain getmaxim.ai Referenced by AI across the ai evaluation platforms query set — the highest-leverage PR target in this category

Visibility spread 38pp Gap between top and bottom of the ranking · 14 brands at 0% (invisible to the AI)

What we observed in this category

Braintrust leads the AI evaluation platforms category with a composite score of 48.8, nearly 13 points ahead of second-ranked Maxim AI at 36.2. LangSmith shares Braintrust's 37.5% visibility score yet sits third due to a far lower citation rate. The category average visibility of 6.9% underlines how concentrated AI Mode attention is at the top, with ranks 8 through 10 (Humanloop, Arize AI, Promptfoo) recording zero visibility and zero citations.

The sharpest divergence between being named and being trusted as a source appears at rank 3. LangSmith matches Braintrust on visibility at 37.5% but holds only a 12.5% citation rate, compared to Braintrust's 75%. Conversely, Galileo and DeepEval each carry 50% citation rates despite only 12.5% visibility, suggesting AI Mode trusts their content when it surfaces them but does not surface them frequently. OpenAI Evals is cited at 12.5% despite zero visibility, meaning it is referenced without being named in direct answers.

Google AI Mode is the top engine for every brand in the dataset, indicating the audit is effectively a single-engine snapshot. The cited sources list is anchored by brand-owned domains (getmaxim.ai, braintrust.dev, confident-ai.com) alongside community and editorial channels including reddit.com, medium.com, and youtube.com. The presence of mlflow.org and institutepm.com in the citation list suggests AI Mode is pulling from open-source project documentation and practitioner-facing content when constructing answers in this category.

Movers & shakers since last refresh

Biggest visibility risers

LangSmith 0% → 38% · rank #4 → #3
+38pp
Maxim AI 0% → 25% · rank #3 → #2
+25pp
Braintrust 25% → 38% · rank #1 → #1
+12pp

Biggest visibility fallers

Galileo 25% → 12% · rank #2 → #4
-12pp

The ranking

#	Brand	Visibility	Citation	Top engine
1	Braintrust braintrust.dev	38%	75%	Google AI Mode
Braintrust leads all brands with a 75% citation rate, more than double the category average of 13.1%, and holds the highest composite score at 48.8.
2	Maxim AI getmaxim.ai	25%	62%	Google AI Mode
Maxim AI rose from zero to 25% visibility this period and its domain (getmaxim.ai) ranks as the top cited source in the entire category.
3	LangSmith langchain.com	38%	12%	Google AI Mode
LangSmith matches Braintrust on visibility at 37.5% but its 12.5% citation rate is dramatically lower, driving its composite score down to 30.0.
4	Galileo galileo.ai	12%	50%	Google AI Mode
Galileo dropped two rank positions as visibility fell from 25% to 12.5%, yet its citation rate rose 37.5 points to 50%, signalling a trust-without-reach dynamic.
5	DeepEval confident-ai.com	12%	50%	Google AI Mode
DeepEval ties Galileo on both visibility (12.5%) and citation rate (50%), with its domain confident-ai.com appearing in the top cited sources list for the category.
6	Ragas explodinggradients.com	12%	0%	Google AI Mode
7	OpenAI Evals openai.com	0%	12%	Google AI Mode
8	Humanloop humanloop.com	0%	0%	Google AI Mode
9	Arize AI arize.com	0%	0%	Google AI Mode
10	Promptfoo promptfoo.dev	0%	0%	Google AI Mode
11	Parea parea.ai	0%	0%	Google AI Mode
12	Giskard giskard.ai	0%	0%	Google AI Mode
13	TruLens trulens.org	0%	0%	Google AI Mode
14	Athina AI athina.ai	0%	0%	Google AI Mode
15	HoneyHive honeyhive.ai	0%	0%	Google AI Mode
16	Weights & Biases wandb.ai	0%	0%	Google AI Mode
17	Fiddler AI fiddler.ai	0%	0%	Google AI Mode
18	Patronus AI patronus.ai	0%	0%	Google AI Mode
19	Comet comet.com	0%	0%	Google AI Mode
20	Vellum vellum.ai	0%	0%	Google AI Mode

Sources AI engines trust in this category

Across the 8 buyer-intent queries we ran on ai evaluation platforms, these are the domains Google AI Mode cited most often. If you're not on this list — or if your competitors are — that's a concrete PR / linkbuilding target.

getmaxim.aibraintrust.devreddit.commedium.comyoutube.comconfident-ai.commlflow.orginstitutepm.com

How to read this ranking

Four things worth knowing before you act on the numbers above. These are the same definitions across every industry page — for category-specific observations, see the What we observed section above (where available) and the per-brand insights inline in the ranking.

Visibility = being named

A brand's visibility % is the share of AI answers that mention it by name in the response prose. This is who AI engines actively recommend to the buyer. More on visibility →

Citation rate = being trusted

Citation rate is the share of AI answers that include the brand's domain as a clickable source link. This is what the AI treats as authoritative evidence, different from being named. More on citation rate →

Top engine differs by brand

The "top engine" column shows which AI surface each brand performs best on. Big gaps between a brand's score across engines usually points to specific content or schema gaps. How AI engines pick sources →

Rankings move month to month

AI engines re-crawl and re-rank on shorter cycles than classical search. We re-audit every brand on this list at least every 30 days and refresh this page automatically. How AI search ranking works →

Get your own ai evaluation platforms brand audited

The brands above were curated from public market-leader lists. Want the same measurement against your own brand — including the queries you appear on, which competitors get named instead, and a prioritised fix list? Run a free preview.

Audit your ai evaluation platforms brand → Browse all rankings Methodology →

Frequently asked about ai evaluation platforms AI visibility

Who leads AI visibility in AI evaluation platforms?

Braintrust leads with a composite score of 48.8 and a 75% citation rate, the highest of any brand in the category. Its nearest competitor, Maxim AI, scores 36.2.

What is the average visibility for brands in the AI evaluation platforms category?

The category average visibility is 6.9% and the average citation rate is 13.1%, indicating that meaningful AI Mode presence is concentrated in only a handful of brands.

Which brands appear in AI answers without being frequently named?

Galileo and DeepEval each hold a 50% citation rate despite only 12.5% visibility, and OpenAI Evals is cited at 12.5% while recording zero visibility.

What sources does AI Mode cite most for AI evaluation platform research?

The top cited sources include brand-owned domains (getmaxim.ai, braintrust.dev, confident-ai.com) alongside reddit.com, medium.com, youtube.com, mlflow.org, and institutepm.com.

Which brands saw the biggest visibility gains in the latest period?

LangSmith and Maxim AI both rose from zero visibility to 37.5% and 25% respectively, while Braintrust grew a further 12.5 percentage points to reach 37.5%.

Are any well-known brands invisible in Google AI Mode for this category?

Humanloop, Arize AI, and Promptfoo all record zero visibility and zero citations, meaning they do not currently appear in or contribute to Google AI Mode answers for AI evaluation platforms.