Universal Intelligence Benchmark

Embodied Intelligence as a UIB Dimension: Measurement Framework and Evaluation Protocol

Posted on April 25, 2026 by

Benchmark Research

Benchmark Research by Oleh Ivchenko · DOI: 10.5281/zenodo.19759259 68stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	13%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	94%	✓	≥80% have a Digital Object Identifier
[b]	CrossRef	13%	○	≥80% indexed in CrossRef
[i]	Indexed	19%	○	≥80% have metadata indexed
[l]	Academic	100%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	16 refs	✓	Minimum 10 references required
[w]	Words [REQ]	1,162	✗	Minimum 2,000 words for a full research article. Current: 1,162
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19759259
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	80%	✓	≥60% of references from 2025–2026. Current: 80%
[c]	Data Charts	0	○	Original data charts from reproducible analysis (min 2). Current: 0
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	2	✓	Mermaid architecture/flow diagrams. Current: 2
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (79 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

The Universal Intelligence Benchmark (UIB) proposes an eight-dimensional, cost-normalized framework for measuring intelligence across diverse AI systems. This article operationalizes the second UIB dimension — Embodied Intelligence (Dembodied) — defining it as the capacity for intelligent behavior arising from physical interaction with an environment, encompassing spatial reasoning, physics und...

Show moreHide

Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19759259 68stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	13%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	94%	✓	≥80% have a Digital Object Identifier
[b]	CrossRef	13%	○	≥80% indexed in CrossRef
[i]	Indexed	19%	○	≥80% have metadata indexed
[l]	Academic	100%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	16 refs	✓	Minimum 10 references required
[w]	Words [REQ]	1,162	✗	Minimum 2,000 words for a full research article. Current: 1,162
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19759259
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	80%	✓	≥60% of references from 2025–2026. Current: 80%
[c]	Data Charts	0	○	Original data charts from reproducible analysis (min 2). Current: 0
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	2	✓	Mermaid architecture/flow diagrams. Current: 2
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (79 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

Universal Intellig…Read More

UIB Open-Source Benchmark Suite: Evaluation Protocol, Reproducibility Guarantees, and Community Validation

Posted on April 5, 2026April 5, 2026 by

Benchmark Research

Benchmark Research by Oleh Ivchenko · DOI: 10.5281/zenodo.19425176 70stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	93%	✓	≥80% from verified, high-quality sources
[a]	DOI	67%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	20%	○	≥80% have metadata indexed
[l]	Academic	67%	○	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	15 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,146	✓	Minimum 2,000 words for a full research article. Current: 2,146
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19425176
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	92%	✓	≥60% of references from 2025–2026. Current: 92%
[c]	Data Charts	3	✓	Original data charts from reproducible analysis (min 2). Current: 3
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (64 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

The Universal Intelligence Benchmark (UIB) theoretical framework, dimensional taxonomy, and composite scoring formula have been developed across nine preceding articles in this series. This article completes the framework by presenting the UIB Open-Source Benchmark Suite — the concrete evaluation infrastructure that operationalizes those concepts for independent replication. We address three re...

Show moreHide

Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19425176 70stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	93%	✓	≥80% from verified, high-quality sources
[a]	DOI	67%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	20%	○	≥80% have metadata indexed
[l]	Academic	67%	○	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	15 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,146	✓	Minimum 2,000 words for a full research article. Current: 2,146
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19425176
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	92%	✓	≥60% of references from 2025–2026. Current: 92%
[c]	Data Charts	3	✓	Original data charts from reproducible analysis (min 2). Current: 3
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (64 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

Universal Intellig…Read More

The UIB Composite Score: Integration Across All Dimensions

Posted on April 4, 2026 by

Benchmark Research

Benchmark Research by Oleh Ivchenko · DOI: 10.5281/zenodo.19423466 68stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	88%	✓	≥80% from verified, high-quality sources
[a]	DOI	50%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	31%	○	≥80% have metadata indexed
[l]	Academic	75%	○	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	16 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,286	✓	Minimum 2,000 words for a full research article. Current: 2,286
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19423466
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	85%	✓	≥60% of references from 2025–2026. Current: 85%
[c]	Data Charts	3	✓	Original data charts from reproducible analysis (min 2). Current: 3
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (61 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

The Universal Intelligence Benchmark (UIB) has systematically developed eight intelligence dimensions over the course of this series: causal reasoning, embodied grounding, temporal planning, social cognition, resource efficiency, linguistic reasoning, multimodal perception, and meta-learning. This article presents the mathematical framework for integrating these dimensions into a single UIB Com...

Show moreHide

Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19423466 68stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	88%	✓	≥80% from verified, high-quality sources
[a]	DOI	50%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	31%	○	≥80% have metadata indexed
[l]	Academic	75%	○	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	16 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,286	✓	Minimum 2,000 words for a full research article. Current: 2,286
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19423466
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	85%	✓	≥60% of references from 2025–2026. Current: 85%
[c]	Data Charts	3	✓	Original data charts from reproducible analysis (min 2). Current: 3
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (61 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

Universal Intellig…Read More

The Future of Intelligence Measurement: A 10-Year Projection

Posted on April 1, 2026 by

Benchmark Research

Benchmark Research by Oleh Ivchenko · DOI: 10.5281/zenodo.19375898 81stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	10%	○	≥80% from editorially reviewed sources
[t]	Trusted	95%	✓	≥80% from verified, high-quality sources
[a]	DOI	70%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	15%	○	≥80% indexed in CrossRef
[i]	Indexed	85%	✓	≥80% have metadata indexed
[l]	Academic	90%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	95%	✓	≥80% are freely accessible
[r]	References	20 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,292	✓	Minimum 2,000 words for a full research article. Current: 2,292
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19375898
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	76%	✓	≥60% of references from 2025–2026. Current: 76%
[c]	Data Charts	5	✓	Original data charts from reproducible analysis (min 2). Current: 5
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (83 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

Intelligence measurement stands at a critical inflection point. The accelerating saturation of static benchmarks — with median time-to-saturation declining from five years in 2019 to under one year by 2025 — demands a fundamental rethinking of how we evaluate artificial intelligence. This article projects the evolution of AI evaluation paradigms over the next decade (2026-2035), analyzing three...

Show moreHide

Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19375898 81stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	10%	○	≥80% from editorially reviewed sources
[t]	Trusted	95%	✓	≥80% from verified, high-quality sources
[a]	DOI	70%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	15%	○	≥80% indexed in CrossRef
[i]	Indexed	85%	✓	≥80% have metadata indexed
[l]	Academic	90%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	95%	✓	≥80% are freely accessible
[r]	References	20 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,292	✓	Minimum 2,000 words for a full research article. Current: 2,292
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19375898
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	76%	✓	≥60% of references from 2025–2026. Current: 76%
[c]	Data Charts	5	✓	Original data charts from reproducible analysis (min 2). Current: 5
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (83 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

Universal Intellig…Read More

The UIB Open-Source Benchmark Suite: Architecture, Reproducibility Guarantees, and Community Validation Protocol

Posted on March 27, 2026March 28, 2026 by

Benchmark Research

Benchmark Research by Oleh Ivchenko · DOI: 10.5281/zenodo.19266345 78stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	85%	✓	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	92%	✓	≥80% have metadata indexed
[l]	Academic	85%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	13 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,650	✓	Minimum 2,000 words for a full research article. Current: 2,650
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19266345
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	20%	✗	≥60% of references from 2025–2026. Current: 20%
[c]	Data Charts	5	✓	Original data charts from reproducible analysis (min 2). Current: 5
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (87 × 60%) + Required (3/5 × 30%) + Optional (3/4 × 10%)

Open-source benchmark frameworks have become the backbone of AI model evaluation, yet none provides simultaneous coverage of multidimensional intelligence measurement, inference cost normalization, and cryptographic reproducibility certification. This article presents the architecture and design rationale for the Universal Intelligence Benchmark (UIB) open-source suite, a modular evaluation fra...

Show moreHide

Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19266345 78stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	85%	✓	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	92%	✓	≥80% have metadata indexed
[l]	Academic	85%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	13 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,650	✓	Minimum 2,000 words for a full research article. Current: 2,650
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19266345
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	20%	✗	≥60% of references from 2025–2026. Current: 20%
[c]	Data Charts	5	✓	Original data charts from reproducible analysis (min 2). Current: 5
[g]	Code	✓	✓	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (87 × 60%) + Required (3/5 × 30%) + Optional (3/4 × 10%)

Universal Intellig…Read More

The UIB Composite Score: Integrating Eight Intelligence Dimensions into a Unified Benchmark

Posted on March 26, 2026 by

Benchmark Research

Benchmark Research by Oleh Ivchenko · DOI: 10.5281/zenodo.19238245 72stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	9%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	91%	✓	≥80% have a Digital Object Identifier
[b]	CrossRef	9%	○	≥80% indexed in CrossRef
[i]	Indexed	91%	✓	≥80% have metadata indexed
[l]	Academic	91%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	11 refs	✓	Minimum 10 references required
[w]	Words [REQ]	1,965	✗	Minimum 2,000 words for a full research article. Current: 1,965
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19238245
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	33%	✗	≥60% of references from 2025–2026. Current: 33%
[c]	Data Charts	5	✓	Original data charts from reproducible analysis (min 2). Current: 5
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (91 × 60%) + Required (2/5 × 30%) + Optional (2/4 × 10%)

Current artificial intelligence benchmarks measure isolated capabilities — reasoning, coding, knowledge retrieval — yet no single metric captures the multidimensional nature of machine intelligence. This article presents the Universal Intelligence Benchmark (UIB) Composite Score, integrating eight previously defined intelligence dimensions (reasoning, causal, temporal, social, efficiency, trans...

Show moreHide

Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19238245 72stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	9%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	91%	✓	≥80% have a Digital Object Identifier
[b]	CrossRef	9%	○	≥80% indexed in CrossRef
[i]	Indexed	91%	✓	≥80% have metadata indexed
[l]	Academic	91%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	11 refs	✓	Minimum 10 references required
[w]	Words [REQ]	1,965	✗	Minimum 2,000 words for a full research article. Current: 1,965
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19238245
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	33%	✗	≥60% of references from 2025–2026. Current: 33%
[c]	Data Charts	5	✓	Original data charts from reproducible analysis (min 2). Current: 5
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (91 × 60%) + Required (2/5 × 30%) + Optional (2/4 × 10%)

Universal Intellig…Read More

Efficiency as Intelligence: The Resource-Normalized Score for Universal Benchmarking

Posted on March 25, 2026 by

Benchmark Research

Benchmark Research by Oleh Ivchenko · DOI: 10.5281/zenodo.19223497 73stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	80%	✓	≥80% from verified, high-quality sources
[a]	DOI	67%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	7%	○	≥80% indexed in CrossRef
[i]	Indexed	87%	✓	≥80% have metadata indexed
[l]	Academic	67%	○	≥80% from journals/conferences/preprints
[f]	Free Access	87%	✓	≥80% are freely accessible
[r]	References	15 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,302	✓	Minimum 2,000 words for a full research article. Current: 2,302
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19223497
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	73%	✓	≥60% of references from 2025–2026. Current: 73%
[c]	Data Charts	4	✓	Original data charts from reproducible analysis (min 2). Current: 4
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (74 × 60%) + Required (4/5 × 30%) + Optional (2/4 × 10%)

As large language models approach ceiling performance on standard benchmarks, the question shifts from "how smart is this model?" to "how smart is this model per unit of resource consumed?" This article proposes the UIB-Efficiency dimension — a resource-normalized intelligence score that integrates accuracy with computational cost, energy consumption, memory footprint, and inference latency. We...

Show moreHide

Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19223497 73stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	80%	✓	≥80% from verified, high-quality sources
[a]	DOI	67%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	7%	○	≥80% indexed in CrossRef
[i]	Indexed	87%	✓	≥80% have metadata indexed
[l]	Academic	67%	○	≥80% from journals/conferences/preprints
[f]	Free Access	87%	✓	≥80% are freely accessible
[r]	References	15 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,302	✓	Minimum 2,000 words for a full research article. Current: 2,302
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19223497
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	73%	✓	≥60% of references from 2025–2026. Current: 73%
[c]	Data Charts	4	✓	Original data charts from reproducible analysis (min 2). Current: 4
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (74 × 60%) + Required (4/5 × 30%) + Optional (2/4 × 10%)

Universal Intellig…Read More

Social and Collaborative Intelligence as a UIB Dimension: Why Theory of Mind Remains the Hardest Benchmark

Posted on March 24, 2026March 24, 2026 by

Benchmark Research

Benchmark Research by Oleh Ivchenko · DOI: 10.5281/zenodo.19209792 67stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	7%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	20%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	100%	✓	≥80% have metadata indexed
[l]	Academic	87%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	15 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,272	✓	Minimum 2,000 words for a full research article. Current: 2,272
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19209792
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	17%	✗	≥60% of references from 2025–2026. Current: 17%
[c]	Data Charts	4	✓	Original data charts from reproducible analysis (min 2). Current: 4
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (73 × 60%) + Required (3/5 × 30%) + Optional (2/4 × 10%)

Current AI evaluation overwhelmingly focuses on individual cognitive tasks — reasoning, coding, mathematics — while neglecting the social and collaborative capabilities that define human intelligence in practice. This article introduces the UIB-Social dimension, a formal evaluation framework for measuring social intelligence in large language models across four sub-dimensions: Theory of Mind (T...

Show moreHide

Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19209792 67stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	7%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	20%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	100%	✓	≥80% have metadata indexed
[l]	Academic	87%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	15 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,272	✓	Minimum 2,000 words for a full research article. Current: 2,272
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19209792
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	17%	✗	≥60% of references from 2025–2026. Current: 17%
[c]	Data Charts	4	✓	Original data charts from reproducible analysis (min 2). Current: 4
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (73 × 60%) + Required (3/5 × 30%) + Optional (2/4 × 10%)

Universal Intellig…Read More

Temporal and Planning Intelligence as a UIB Dimension: Why Horizon Length Breaks Modern Reasoning Models

Posted on March 24, 2026 by

Benchmark Research

Benchmark Research by Oleh Ivchenko · DOI: 10.5281/zenodo.19207333 81stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	77%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	100%	✓	≥80% have metadata indexed
[l]	Academic	85%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	13 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,339	✓	Minimum 2,000 words for a full research article. Current: 2,339
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19207333
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	80%	✓	≥60% of references from 2025–2026. Current: 80%
[c]	Data Charts	4	✓	Original data charts from reproducible analysis (min 2). Current: 4
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (87 × 60%) + Required (4/5 × 30%) + Optional (2/4 × 10%)

Temporal reasoning and long-horizon planning represent perhaps the most consequential gap between current large language models and human cognitive capability. While frontier models achieve near-human performance on short planning tasks (under 15 steps), their accuracy degrades catastrophically beyond 25 planning steps — a phenomenon we term the horizon collapse. This article examines three res...

Show moreHide

Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19207333 81stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	0%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	77%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	100%	✓	≥80% have metadata indexed
[l]	Academic	85%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	100%	✓	≥80% are freely accessible
[r]	References	13 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,339	✓	Minimum 2,000 words for a full research article. Current: 2,339
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19207333
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	80%	✓	≥60% of references from 2025–2026. Current: 80%
[c]	Data Charts	4	✓	Original data charts from reproducible analysis (min 2). Current: 4
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (87 × 60%) + Required (4/5 × 30%) + Optional (2/4 × 10%)

Universal Intellig…Read More

Embodied Intelligence as a UIB Dimension: Why Physical Grounding Is the Missing Benchmark

Posted on March 20, 2026 by

Benchmark Research

Benchmark Research by Oleh Ivchenko · DOI: 10.5281/zenodo.19135583 69stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	5%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	48%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	100%	✓	≥80% have metadata indexed
[l]	Academic	90%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	95%	✓	≥80% are freely accessible
[r]	References	21 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,982	✓	Minimum 2,000 words for a full research article. Current: 2,982
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19135583
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	22%	✗	≥60% of references from 2025–2026. Current: 22%
[c]	Data Charts	0	○	Original data charts from reproducible analysis (min 2). Current: 0
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (81 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

Current intelligence benchmarks evaluate AI systems as disembodied reasoners operating on text, images, and symbolic tasks detached from physical reality. This article introduces Embodied Intelligence as a formal dimension within the Universal Intelligence Benchmark (UIB) framework, arguing that any comprehensive measure of machine intelligence must assess a system's capacity for sensorimotor g...

Show moreHide

Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19135583 69stabilfr·wdophcgmx

Badge	Metric	Value	Status	Description
[s]	Reviewed Sources	5%	○	≥80% from editorially reviewed sources
[t]	Trusted	100%	✓	≥80% from verified, high-quality sources
[a]	DOI	48%	○	≥80% have a Digital Object Identifier
[b]	CrossRef	0%	○	≥80% indexed in CrossRef
[i]	Indexed	100%	✓	≥80% have metadata indexed
[l]	Academic	90%	✓	≥80% from journals/conferences/preprints
[f]	Free Access	95%	✓	≥80% are freely accessible
[r]	References	21 refs	✓	Minimum 10 references required
[w]	Words [REQ]	2,982	✓	Minimum 2,000 words for a full research article. Current: 2,982
[d]	DOI [REQ]	✓	✓	Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19135583
[o]	ORCID [REQ]	✓	✓	Author ORCID verified for academic identity
[p]	Peer Reviewed [REQ]	—	✗	Peer reviewed by an assigned reviewer
[h]	Freshness [REQ]	22%	✗	≥60% of references from 2025–2026. Current: 22%
[c]	Data Charts	0	○	Original data charts from reproducible analysis (min 2). Current: 0
[g]	Code	—	○	Source code available on GitHub
[m]	Diagrams	3	✓	Mermaid architecture/flow diagrams. Current: 3
[x]	Cited by	0	○	Referenced by 0 other hub article(s)

Score = Ref Trust (81 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

Universal Intellig…Read More