Universal Intelligence Benchmark

API Access for Researchers — All data and models from this series are available via the API Gateway. Get your API key →

Abstract mathematical geometry — intelligence measurement

Benchmark Research · Stabilarity Research Hub

Inference-Agnostic Intelligence Measurement for the Post-Text Era

Oleh Ivchenko¹

¹ Odesa National Polytechnic University (ONPU)

Type: Meta-Research
Status: Ongoing · 0/11 articles · 2026–ongoing
Links: GitHub

11 Articles Planned · 3 Research Phases · 2026–ongoing · In Progress

Abstract

Current AI benchmarks measure a narrow slice of intelligence — predominantly text comprehension and generation. As AI systems evolve into embodied agents, multimodal reasoners, and autonomous planners, the measurement instruments have not kept pace. This series conducts a systematic meta-meta-analysis of 200+ benchmark studies, exposes the dimensional blind spots in current evaluation frameworks, and proposes the Universal Intelligence Benchmark (UIB): an inference-agnostic, eight-dimensional measurement framework covering causal reasoning, embodied task completion, temporal planning, social cognition, tool creation, cross-domain transfer, multimodal synthesis, and resource-normalized efficiency. The goal is not another leaderboard — it is a fundamental rethinking of what “intelligence” means when the system under test is no longer just a language model.

Interactive Tool

Try the UIB Benchmark Tool

Run benchmark evaluations, explore the eight intelligence dimensions, and compare model scores on the live leaderboard.

Open UIB Benchmark API Documentation

Idea and Motivation

Every frontier AI model now scores above 90% on MMLU, HumanEval, and HellaSwag. The benchmarks are saturated. Meanwhile, these same models fail at causal reasoning, long-horizon planning, and embodied tasks. The measurement instruments have become the bottleneck — not the systems being measured.

This series begins from a simple observation: when every leading system aces the test, the test is no longer measuring what matters. Goodhart’s Law has taken hold — models are optimised for benchmark performance rather than genuine cognitive capability. We need benchmarks that are agnostic to inference modality and test genuine cognitive capabilities across dimensions that current frameworks ignore entirely.

Goal

Develop and validate a universal, inference-agnostic intelligence measurement framework (UIB) through systematic meta-research, dimensional analysis, and open-source implementation. The framework must be applicable to any AI system — text-based, multimodal, embodied, or hybrid — without privileging any particular inference modality or architectural paradigm.

The end product is not a single paper but a complete research programme: theoretical foundations, per-dimension measurement instruments, a composite scoring methodology, and an open-source benchmark suite that the research community can adopt, critique, and extend.

Scope

The series covers 11 articles across three research phases:

Table 1. Research phases and thematic coverage
Phase	Focus Area	Key Topics
1 — Foundation	Measurement Crisis	Meta-meta-analysis of 200+ benchmark studies, benchmark saturation diagnosis, Goodhart’s Law in AI evaluation, construct validity analysis, theoretical UIB framework proposal
2 — Dimension Deep-Dives	Eight UIB Dimensions	Causal reasoning vs pattern matching, embodied task completion, temporal planning and long-horizon goals, social cognition, tool creation, cross-domain transfer, multimodal synthesis, resource-normalized efficiency
3 — Synthesis	Integration and Implementation	Composite scoring methodology, dimensional weighting, open-source benchmark suite, empirical validation protocol, 10-year measurement obsolescence projections

The Eight UIB Dimensions

The UIB framework measures intelligence across eight orthogonal dimensions. The radar chart below visualises placeholder scores across all dimensions, representing the measurement space the benchmark covers.

Focus

The primary analytical focus is on the gap between what current benchmarks measure and what constitutes genuine intelligence. Six areas receive sustained attention throughout the series:

Benchmark saturation and Goodhart’s Law — documenting how optimisation pressure has rendered major benchmarks uninformative.
Construct validity of current AI evaluations — examining whether benchmarks actually measure the constructs they claim to measure.
Causal reasoning vs pattern matching — distinguishing genuine causal understanding from statistical correlation exploitation.
Embodied and multimodal intelligence — measuring capabilities that require physical or cross-modal reasoning.
Resource-normalized efficiency scoring — evaluating intelligence per unit of compute, data, and energy.
Open-source benchmark implementation — delivering usable evaluation tools, not just theoretical frameworks.

Limitations

Black-box evaluation onlyNo proprietary model internals are accessed. All evaluation is conducted through inference-time observation, limiting analysis of internal representations.

Theoretical until Phase 3The UIB framework remains theoretical until empirical validation in the synthesis phase. Early articles propose; later articles test.

Incomplete human baselinesHuman baselines may be incomplete for novel dimensions such as tool creation and cross-domain transfer, where no established psychometric instruments exist.

Ground truth gapsSome dimensions — particularly social cognition and tool creation — lack established ground truth, making evaluation design inherently more speculative.

Scientific Value

The series makes five contributions to the field. First, it provides the first systematic meta-meta-analysis of AI benchmark research — examining not individual benchmarks but the research practices and assumptions underlying benchmark design itself. Second, it proposes the novel eight-dimensional UIB framework as an alternative to single-score leaderboard evaluation. Third, it delivers an open-source benchmark suite designed for community adoption, replication, and extension.

Fourth, it introduces a resource-efficiency normalization methodology that evaluates intelligence relative to computational cost — addressing the growing concern that raw capability scores mask enormous differences in inference expense. Fifth, it produces 10-year measurement obsolescence projections, offering the research community a structured forecast of when current evaluation instruments will lose discriminative power.

Cross-Series Integration

This series draws on and feeds back into the entire Stabilarity research ecosystem:

Table 2. Cross-references to Stabilarity research series
Series	Connection	API Endpoint
AI Economics	ROI vs benchmark score correlation	`/v1/tools/roi`
Cost-Effective AI	Model efficiency scoring	`/v1/tools/risk`
HPF-P Framework	Decision Readiness as intelligence proxy	`/v1/hpf/analyze`
AI Observability	Runtime benchmark monitoring	`/v1/uib/status`
Capability-Adoption Gap	Gap between scores and deployment	`/v1/tools/classify`
Open Humanoid	Embodied dimension validation	—
Future of AI	Benchmark obsolescence prediction	—
Geopolitical Risk	AI capability distribution by nation	`/v1/geo-risk/data/countries`
ScanLab	Domain-specific medical intelligence	`/v1/scanlab/predict`

Key References

Schmidhuber, J. (2024). “Annotated History of Modern AI and Deep Learning.” arXiv:2212.11279v7.
Schmidhuber, J. (2009). “Ultimate Cognition à la Gödel.” Cognitive Computation 1(2):177–193.
Legg, S. & Hutter, M. (2007). “Universal Intelligence: A Definition of Machine Intelligence.” Minds and Machines 17(4):391–444.
Chollet, F. (2019). “On the Measure of Intelligence.” arXiv:1911.01547.
Ivchenko, O. (2026). “Model Benchmarking for Business.” Stabilarity Research Hub.

Resources

GitHub Repository→
Stabilarity Research Hub→
API — status, run, leaderboard, dimensions→
Interactive UIB Benchmark Tool→
Jupyter Notebooks — coming soon

Status

In progress. 0 of 11 articles published. Series launched March 2026. Phase 1 (Foundation) is in active development. Articles will be published sequentially and listed below as they become available.

Contribution Opportunities

Researchers wishing to engage with or build on this work are encouraged to consider the following directions:

Benchmark archaeology: Contribute to the meta-meta-analysis by identifying benchmark studies not covered in the initial 200+ survey, particularly from non-English-language research communities.
Dimension proposals: Suggest additional intelligence dimensions not covered by the eight-dimensional UIB framework, with supporting psychometric or cognitive science literature.
Empirical validation: Run UIB evaluation protocols against frontier models once the Phase 3 benchmark suite is released, contributing results to the open dataset.
Efficiency measurement: Develop or refine resource-normalization metrics that account for hardware heterogeneity, energy costs, and inference latency across deployment contexts.
Human baselines: Design and conduct psychometric studies establishing human performance baselines on novel UIB dimensions, particularly tool creation and cross-domain transfer.

Published Articles

Meta-Research · 14 published

By Oleh Ivchenko

Benchmark research based on publicly available meta-analyses and reproducible evaluation methods.

All Articles

The Meta-Meta-Analysis: A Systematic Map of What 200 AI Benchmark Studies Actually Measured DOI 5/10 40stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	6%	○	≥80% from editorially reviewed sources
[t]	Trusted	39%	○	≥80% from verified, high-quality sources
[a]	DOI	11%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	39%	○	≥80% have metadata indexed
[l]	Academic	39%	○	≥80% from journals/conferences/preprints
[f]	Free Access	44%	○	≥80% are freely accessible
[r]	References	18 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,359	✓	Minimum 2,000 words for a full research article. Current: 2,359
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19001033
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	32%	✗	≥60% of references from 2025–2026. Current: 32%
[c]	Data Charts	0	○	Original data charts from reproducible analysis (min 2). Current: 0
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (33 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

Meta-Research · Mar 13, 2026 · 12 min read

The Measurement Crisis: Saturation, Goodhart's Law, and the End of AI Leaderboards DOI 10/10 44stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	40%	○	≥80% from verified, high-quality sources
[a]	DOI	40%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	40%	○	≥80% have metadata indexed
[l]	Academic	40%	○	≥80% from journals/conferences/preprints
[f]	Free Access	80%	✓	≥80% are freely accessible
[r]	References	5 refs	○	Minimum 10 references required
[w]	Words [REQ]	3,057	✓	Minimum 2,000 words for a full research article. Current: 3,057
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19007432
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	33%	✗	≥60% of references from 2025–2026. Current: 33%
[c]	Data Charts	0	○	Original data charts from reproducible analysis (min 2). Current: 0
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (39 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

Meta-Research · Mar 13, 2026 · 15 min read

Inference-Agnostic Intelligence: The UIB Theoretical Framework DOI 4/10 58stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	11%	○	≥80% from editorially reviewed sources
[t]	Trusted	74%	○	≥80% from verified, high-quality sources
[a]	DOI	42%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	5%	○	≥80% indexed in CrossRef
[i]	Indexed	68%	○	≥80% have metadata indexed
[l]	Academic	63%	○	≥80% from journals/conferences/preprints
[f]	Free Access	74%	○	≥80% are freely accessible
[r]	References	19 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,086	✓	Minimum 2,000 words for a full research article. Current: 2,086
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19064304
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	33%	✗	≥60% of references from 2025–2026. Current: 33%
[c]	Data Charts	0	○	Original data charts from reproducible analysis (min 2). Current: 0
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (62 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

Meta-Research · Mar 17, 2026 · 10 min read

Causal Intelligence as a UIB Dimension: Measuring What Models Actually Understand DOI 3/10 52stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	7%	○	≥80% from editorially reviewed sources
[t]	Trusted	73%	○	≥80% from verified, high-quality sources
[a]	DOI	53%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	7%	○	≥80% indexed in CrossRef
[i]	Indexed	53%	○	≥80% have metadata indexed
[l]	Academic	80%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	93%	✓	≥80% are freely accessible
[r]	References	15 refs	✓	Minimum 10 references required
[w]	Words [REQ]	1,942	✗	Minimum 2,000 words for a full research article. Current: 1,942
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19102383
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	27%	✗	≥60% of references from 2025–2026. Current: 27%
[c]	Data Charts	0	○	Original data charts from reproducible analysis (min 2). Current: 0
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (63 × 60%) + Required (2/5 × 30%) + Optional (1/4 × 10%)

Meta-Research · Mar 18, 2026 · 10 min read

Embodied Intelligence as a UIB Dimension: Why Physical Grounding Is the Missing Benchmark DOI 5/10 65stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	4%	○	≥80% from editorially reviewed sources
[t]	Trusted	92%	✓	≥80% from verified, high-quality sources
[a]	DOI	42%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	92%	✓	≥80% have metadata indexed
[l]	Academic	79%	○	≥80% from journals/conferences/preprints
[f]	Free Access	96%	✓	≥80% are freely accessible
[r]	References	24 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,990	✓	Minimum 2,000 words for a full research article. Current: 2,990
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19135583
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	19%	✗	≥60% of references from 2025–2026. Current: 19%
[c]	Data Charts	0	○	Original data charts from reproducible analysis (min 2). Current: 0
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (74 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

Meta-Research · Mar 20, 2026 · 15 min read

Temporal and Planning Intelligence as a UIB Dimension: Why Horizon Length Breaks Modern Reasoning Models DOI 3/10 74stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	88%	✓	≥80% from verified, high-quality sources
[a]	DOI	63%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	88%	✓	≥80% have metadata indexed
[l]	Academic	69%	○	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	16 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,347	✓	Minimum 2,000 words for a full research article. Current: 2,347
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19207333
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	62%	✓	≥60% of references from 2025–2026. Current: 62%
[c]	Data Charts	4	✓	Original data charts from reproducible analysis (min 2). Current: 4
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (75 × 60%) + Required (4/5 × 30%) + Optional (2/4 × 10%)

Meta-Research · Mar 24, 2026 · 12 min read

Social and Collaborative Intelligence as a UIB Dimension: Why Theory of Mind Remains the Hardest Benchmark DOI 4/10 62stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	6%	○	≥80% from editorially reviewed sources
[t]	Trusted	88%	✓	≥80% from verified, high-quality sources
[a]	DOI	18%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	88%	✓	≥80% have metadata indexed
[l]	Academic	76%	○	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	17 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,274	✓	Minimum 2,000 words for a full research article. Current: 2,274
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19209792
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	13%	✗	≥60% of references from 2025–2026. Current: 13%
[c]	Data Charts	4	✓	Original data charts from reproducible analysis (min 2). Current: 4
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (65 × 60%) + Required (3/5 × 30%) + Optional (2/4 × 10%)

Meta-Research · Mar 24, 2026 · 11 min read

Efficiency as Intelligence: The Resource-Normalized Score for Universal Benchmarking DOI 4/10 62stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	72%	○	≥80% from verified, high-quality sources
[a]	DOI	56%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	6%	○	≥80% indexed in CrossRef
[i]	Indexed	78%	○	≥80% have metadata indexed
[l]	Academic	56%	○	≥80% from journals/conferences/preprints
[f]	Free Access	89%	✓	≥80% are freely accessible
[r]	References	18 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,310	✓	Minimum 2,000 words for a full research article. Current: 2,310
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19223497
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	57%	✗	≥60% of references from 2025–2026. Current: 57%
[c]	Data Charts	4	✓	Original data charts from reproducible analysis (min 2). Current: 4
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (65 × 60%) + Required (3/5 × 30%) + Optional (2/4 × 10%)

Meta-Research · Mar 25, 2026 · 12 min read

The UIB Composite Score: Integrating Eight Intelligence Dimensions into a Unified Benchmark DOI 3/10 63stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	7%	○	≥80% from editorially reviewed sources
[t]	Trusted	86%	✓	≥80% from verified, high-quality sources
[a]	DOI	71%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	7%	○	≥80% indexed in CrossRef
[i]	Indexed	79%	○	≥80% have metadata indexed
[l]	Academic	71%	○	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	14 refs	✓	Minimum 10 references required
[w]	Words [REQ]	1,969	✗	Minimum 2,000 words for a full research article. Current: 1,969
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19238245
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	25%	✗	≥60% of references from 2025–2026. Current: 25%
[c]	Data Charts	5	✓	Original data charts from reproducible analysis (min 2). Current: 5
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (76 × 60%) + Required (2/5 × 30%) + Optional (2/4 × 10%)

Meta-Research · Mar 26, 2026 · 10 min read

The UIB Open-Source Benchmark Suite: Architecture, Reproducibility Guarantees, and Community Validation Protocol DOI 3/10 71stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	88%	✓	≥80% from verified, high-quality sources
[a]	DOI	69%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	81%	✓	≥80% have metadata indexed
[l]	Academic	69%	○	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	16 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,652	✓	Minimum 2,000 words for a full research article. Current: 2,652
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19266345
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	15%	✗	≥60% of references from 2025–2026. Current: 15%
[c]	Data Charts	5	✓	Original data charts from reproducible analysis (min 2). Current: 5
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (75 × 60%) + Required (3/5 × 30%) + Optional (3/4 × 10%)

Meta-Research · Mar 27, 2026 · 13 min read

The Future of Intelligence Measurement: A 10-Year Projection DOI 3/10 77stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	9%	○	≥80% from editorially reviewed sources
[t]	Trusted	86%	✓	≥80% from verified, high-quality sources
[a]	DOI	64%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	14%	○	≥80% indexed in CrossRef
[i]	Indexed	77%	○	≥80% have metadata indexed
[l]	Academic	82%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	95%	✓	≥80% are freely accessible
[r]	References	22 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,294	✓	Minimum 2,000 words for a full research article. Current: 2,294
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19375898
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	65%	✓	≥60% of references from 2025–2026. Current: 65%
[c]	Data Charts	5	✓	Original data charts from reproducible analysis (min 2). Current: 5
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (76 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

Meta-Research · Apr 1, 2026 · 11 min read

The UIB Composite Score: Integration Across All Dimensions DOI 3/10 65stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	78%	○	≥80% from verified, high-quality sources
[a]	DOI	44%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	28%	○	≥80% have metadata indexed
[l]	Academic	67%	○	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	18 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,294	✓	Minimum 2,000 words for a full research article. Current: 2,294
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19423466
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	69%	✓	≥60% of references from 2025–2026. Current: 69%
[c]	Data Charts	3	✓	Original data charts from reproducible analysis (min 2). Current: 3
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (55 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

Meta-Research · Apr 4, 2026 · 11 min read

UIB Open-Source Benchmark Suite: Evaluation Protocol, Reproducibility Guarantees, and Community Validation DOI 2/10 64stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	78%	○	≥80% from verified, high-quality sources
[a]	DOI	56%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	17%	○	≥80% have metadata indexed
[l]	Academic	56%	○	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	18 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,146	✓	Minimum 2,000 words for a full research article. Current: 2,146
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19425176
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	73%	✓	≥60% of references from 2025–2026. Current: 73%
[c]	Data Charts	3	✓	Original data charts from reproducible analysis (min 2). Current: 3
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (54 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

Meta-Research · Apr 5, 2026 · 11 min read

Embodied Intelligence as a UIB Dimension: Measurement Framework and Evaluation Protocol DOI 2/10 63stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	11%	○	≥80% from editorially reviewed sources
[t]	Trusted	89%	✓	≥80% from verified, high-quality sources
[a]	DOI	83%	✓	≥80% have a Digital Object Identifier
[b]	CrossRef	11%	○	≥80% indexed in CrossRef
[i]	Indexed	17%	○	≥80% have metadata indexed
[l]	Academic	89%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	18 refs	✓	Minimum 10 references required
[w]	Words [REQ]	1,168	✗	Minimum 2,000 words for a full research article. Current: 1,168
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19759259
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	67%	✓	≥60% of references from 2025–2026. Current: 67%
[c]	Data Charts	0	○	Original data charts from reproducible analysis (min 2). Current: 0
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	2	✓	Mermaid architecture/flow diagrams. Current: 2
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (70 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

Meta-Research · Apr 25, 2026 · 6 min read

14 published5,078 total views159 min total readingMar 2026 – Apr 2026 published