Skip to content

Stabilarity Hub

Menu
  • Home
  • Research
    • Healthcare & Life Sciences
      • Medical ML Diagnosis
    • Enterprise & Economics
      • AI Economics
      • Cost-Effective AI
      • Spec-Driven AI
    • Geopolitics & Strategy
      • Anticipatory Intelligence
      • Future of AI
      • Geopolitical Risk Intelligence
    • AI & Future Signals
      • Capability–Adoption Gap
      • AI Observability
      • AI Intelligence Architecture
      • AI Memory
      • Trusted Open Source
    • Data Science & Methods
      • HPF-P Framework
      • Intellectual Data Analysis
      • Reference Evaluation
    • Publications
      • External Publications
    • Robotics & Engineering
      • Open Humanoid
      • Open Starship
    • Benchmarks & Measurement
      • Universal Intelligence Benchmark
      • Shadow Economy Dynamics
      • Article Quality Science
  • Tools
    • Healthcare & Life Sciences
      • ScanLab
      • AI Data Readiness Assessment
    • Enterprise Strategy
      • AI Use Case Classifier
      • ROI Calculator
      • Risk Calculator
      • Reference Trust Analyzer
    • Portfolio & Analytics
      • HPF Portfolio Optimizer
      • Adoption Gap Monitor
      • Data Mining Method Selector
    • Geopolitics & Prediction
      • War Prediction Model
      • Ukraine Crisis Prediction
      • Gap Analyzer
      • Geopolitical Stability Dashboard
    • Technical & Observability
      • OTel AI Inspector
    • Robotics & Engineering
      • Humanoid Simulation
    • Benchmarks
      • UIB Benchmark Tool
    • Article Evaluator
    • Open Starship Simulation
  • API Gateway
  • About
    • Contributors
  • Contact
  • Join Community
  • Terms of Service
  • Login
  • Register
Menu

Category: Universal Intelligence Benchmark

Inference-agnostic intelligence measurement framework. Meta-research and novel benchmarks for AI beyond text generation.

Embodied Intelligence as a UIB Dimension: Measurement Framework and Evaluation Protocol

Posted on April 25, 2026 by
Benchmark Research
Benchmark Research by Oleh Ivchenko  ·  DOI: 10.5281/zenodo.19759259  68stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources13%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI94%✓≥80% have a Digital Object Identifier
[b]CrossRef13%○≥80% indexed in CrossRef
[i]Indexed19%○≥80% have metadata indexed
[l]Academic100%✓≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References16 refs✓Minimum 10 references required
[w]Words [REQ]1,162✗Minimum 2,000 words for a full research article. Current: 1,162
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19759259
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]80%✓≥60% of references from 2025–2026. Current: 80%
[c]Data Charts0○Original data charts from reproducible analysis (min 2). Current: 0
[g]Code—○Source code available on GitHub
[m]Diagrams2✓Mermaid architecture/flow diagrams. Current: 2
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (79 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

The Universal Intelligence Benchmark (UIB) proposes an eight-dimensional, cost-normalized framework for measuring intelligence across diverse AI systems. This article operationalizes the second UIB dimension — Embodied Intelligence (Dembodied) — defining it as the capacity for intelligent behavior arising from physical interaction with an environment, encompassing spatial reasoning, physics und...

Show moreHide
Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19759259 68stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources13%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI94%✓≥80% have a Digital Object Identifier
[b]CrossRef13%○≥80% indexed in CrossRef
[i]Indexed19%○≥80% have metadata indexed
[l]Academic100%✓≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References16 refs✓Minimum 10 references required
[w]Words [REQ]1,162✗Minimum 2,000 words for a full research article. Current: 1,162
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19759259
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]80%✓≥60% of references from 2025–2026. Current: 80%
[c]Data Charts0○Original data charts from reproducible analysis (min 2). Current: 0
[g]Code—○Source code available on GitHub
[m]Diagrams2✓Mermaid architecture/flow diagrams. Current: 2
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (79 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)
Universal Intellig…Read More
Read more

UIB Open-Source Benchmark Suite: Evaluation Protocol, Reproducibility Guarantees, and Community Validation

Posted on April 5, 2026April 5, 2026 by
Benchmark Research
Benchmark Research by Oleh Ivchenko  ·  DOI: 10.5281/zenodo.19425176  70stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources0%○≥80% from editorially reviewed sources
[t]Trusted93%✓≥80% from verified, high-quality sources
[a]DOI67%○≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed20%○≥80% have metadata indexed
[l]Academic67%○≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References15 refs✓Minimum 10 references required
[w]Words [REQ]2,146✓Minimum 2,000 words for a full research article. Current: 2,146
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19425176
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]92%✓≥60% of references from 2025–2026. Current: 92%
[c]Data Charts3✓Original data charts from reproducible analysis (min 2). Current: 3
[g]Code✓✓Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (64 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

The Universal Intelligence Benchmark (UIB) theoretical framework, dimensional taxonomy, and composite scoring formula have been developed across nine preceding articles in this series. This article completes the framework by presenting the UIB Open-Source Benchmark Suite — the concrete evaluation infrastructure that operationalizes those concepts for independent replication. We address three re...

Show moreHide
Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19425176 70stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources0%○≥80% from editorially reviewed sources
[t]Trusted93%✓≥80% from verified, high-quality sources
[a]DOI67%○≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed20%○≥80% have metadata indexed
[l]Academic67%○≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References15 refs✓Minimum 10 references required
[w]Words [REQ]2,146✓Minimum 2,000 words for a full research article. Current: 2,146
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19425176
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]92%✓≥60% of references from 2025–2026. Current: 92%
[c]Data Charts3✓Original data charts from reproducible analysis (min 2). Current: 3
[g]Code✓✓Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (64 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)
Universal Intellig…Read More
Read more

The UIB Composite Score: Integration Across All Dimensions

Posted on April 4, 2026 by
Benchmark Research
Benchmark Research by Oleh Ivchenko  ·  DOI: 10.5281/zenodo.19423466  68stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources0%○≥80% from editorially reviewed sources
[t]Trusted88%✓≥80% from verified, high-quality sources
[a]DOI50%○≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed31%○≥80% have metadata indexed
[l]Academic75%○≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References16 refs✓Minimum 10 references required
[w]Words [REQ]2,286✓Minimum 2,000 words for a full research article. Current: 2,286
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19423466
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]85%✓≥60% of references from 2025–2026. Current: 85%
[c]Data Charts3✓Original data charts from reproducible analysis (min 2). Current: 3
[g]Code✓✓Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (61 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

The Universal Intelligence Benchmark (UIB) has systematically developed eight intelligence dimensions over the course of this series: causal reasoning, embodied grounding, temporal planning, social cognition, resource efficiency, linguistic reasoning, multimodal perception, and meta-learning. This article presents the mathematical framework for integrating these dimensions into a single UIB Com...

Show moreHide
Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19423466 68stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources0%○≥80% from editorially reviewed sources
[t]Trusted88%✓≥80% from verified, high-quality sources
[a]DOI50%○≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed31%○≥80% have metadata indexed
[l]Academic75%○≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References16 refs✓Minimum 10 references required
[w]Words [REQ]2,286✓Minimum 2,000 words for a full research article. Current: 2,286
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19423466
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]85%✓≥60% of references from 2025–2026. Current: 85%
[c]Data Charts3✓Original data charts from reproducible analysis (min 2). Current: 3
[g]Code✓✓Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (61 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)
Universal Intellig…Read More
Read more

The Future of Intelligence Measurement: A 10-Year Projection

Posted on April 1, 2026 by
Benchmark Research
Benchmark Research by Oleh Ivchenko  ·  DOI: 10.5281/zenodo.19375898  81stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources10%○≥80% from editorially reviewed sources
[t]Trusted95%✓≥80% from verified, high-quality sources
[a]DOI70%○≥80% have a Digital Object Identifier
[b]CrossRef15%○≥80% indexed in CrossRef
[i]Indexed85%✓≥80% have metadata indexed
[l]Academic90%✓≥80% from journals/conferences/preprints
[f]Free Access95%✓≥80% are freely accessible
[r]References20 refs✓Minimum 10 references required
[w]Words [REQ]2,292✓Minimum 2,000 words for a full research article. Current: 2,292
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19375898
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]76%✓≥60% of references from 2025–2026. Current: 76%
[c]Data Charts5✓Original data charts from reproducible analysis (min 2). Current: 5
[g]Code✓✓Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (83 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)

Intelligence measurement stands at a critical inflection point. The accelerating saturation of static benchmarks — with median time-to-saturation declining from five years in 2019 to under one year by 2025 — demands a fundamental rethinking of how we evaluate artificial intelligence. This article projects the evolution of AI evaluation paradigms over the next decade (2026-2035), analyzing three...

Show moreHide
Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19375898 81stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources10%○≥80% from editorially reviewed sources
[t]Trusted95%✓≥80% from verified, high-quality sources
[a]DOI70%○≥80% have a Digital Object Identifier
[b]CrossRef15%○≥80% indexed in CrossRef
[i]Indexed85%✓≥80% have metadata indexed
[l]Academic90%✓≥80% from journals/conferences/preprints
[f]Free Access95%✓≥80% are freely accessible
[r]References20 refs✓Minimum 10 references required
[w]Words [REQ]2,292✓Minimum 2,000 words for a full research article. Current: 2,292
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19375898
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]76%✓≥60% of references from 2025–2026. Current: 76%
[c]Data Charts5✓Original data charts from reproducible analysis (min 2). Current: 5
[g]Code✓✓Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (83 × 60%) + Required (4/5 × 30%) + Optional (3/4 × 10%)
Universal Intellig…Read More
Read more

The UIB Open-Source Benchmark Suite: Architecture, Reproducibility Guarantees, and Community Validation Protocol

Posted on March 27, 2026March 28, 2026 by
Benchmark Research
Benchmark Research by Oleh Ivchenko  ·  DOI: 10.5281/zenodo.19266345  78stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources0%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI85%✓≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed92%✓≥80% have metadata indexed
[l]Academic85%✓≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References13 refs✓Minimum 10 references required
[w]Words [REQ]2,650✓Minimum 2,000 words for a full research article. Current: 2,650
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19266345
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]20%✗≥60% of references from 2025–2026. Current: 20%
[c]Data Charts5✓Original data charts from reproducible analysis (min 2). Current: 5
[g]Code✓✓Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (87 × 60%) + Required (3/5 × 30%) + Optional (3/4 × 10%)

Open-source benchmark frameworks have become the backbone of AI model evaluation, yet none provides simultaneous coverage of multidimensional intelligence measurement, inference cost normalization, and cryptographic reproducibility certification. This article presents the architecture and design rationale for the Universal Intelligence Benchmark (UIB) open-source suite, a modular evaluation fra...

Show moreHide
Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19266345 78stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources0%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI85%✓≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed92%✓≥80% have metadata indexed
[l]Academic85%✓≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References13 refs✓Minimum 10 references required
[w]Words [REQ]2,650✓Minimum 2,000 words for a full research article. Current: 2,650
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19266345
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]20%✗≥60% of references from 2025–2026. Current: 20%
[c]Data Charts5✓Original data charts from reproducible analysis (min 2). Current: 5
[g]Code✓✓Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (87 × 60%) + Required (3/5 × 30%) + Optional (3/4 × 10%)
Universal Intellig…Read More
Read more

The UIB Composite Score: Integrating Eight Intelligence Dimensions into a Unified Benchmark

Posted on March 26, 2026 by
Benchmark Research
Benchmark Research by Oleh Ivchenko  ·  DOI: 10.5281/zenodo.19238245  72stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources9%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI91%✓≥80% have a Digital Object Identifier
[b]CrossRef9%○≥80% indexed in CrossRef
[i]Indexed91%✓≥80% have metadata indexed
[l]Academic91%✓≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References11 refs✓Minimum 10 references required
[w]Words [REQ]1,965✗Minimum 2,000 words for a full research article. Current: 1,965
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19238245
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]33%✗≥60% of references from 2025–2026. Current: 33%
[c]Data Charts5✓Original data charts from reproducible analysis (min 2). Current: 5
[g]Code—○Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (91 × 60%) + Required (2/5 × 30%) + Optional (2/4 × 10%)

Current artificial intelligence benchmarks measure isolated capabilities — reasoning, coding, knowledge retrieval — yet no single metric captures the multidimensional nature of machine intelligence. This article presents the Universal Intelligence Benchmark (UIB) Composite Score, integrating eight previously defined intelligence dimensions (reasoning, causal, temporal, social, efficiency, trans...

Show moreHide
Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19238245 72stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources9%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI91%✓≥80% have a Digital Object Identifier
[b]CrossRef9%○≥80% indexed in CrossRef
[i]Indexed91%✓≥80% have metadata indexed
[l]Academic91%✓≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References11 refs✓Minimum 10 references required
[w]Words [REQ]1,965✗Minimum 2,000 words for a full research article. Current: 1,965
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19238245
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]33%✗≥60% of references from 2025–2026. Current: 33%
[c]Data Charts5✓Original data charts from reproducible analysis (min 2). Current: 5
[g]Code—○Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (91 × 60%) + Required (2/5 × 30%) + Optional (2/4 × 10%)
Universal Intellig…Read More
Read more

Efficiency as Intelligence: The Resource-Normalized Score for Universal Benchmarking

Posted on March 25, 2026 by
Benchmark Research
Benchmark Research by Oleh Ivchenko  ·  DOI: 10.5281/zenodo.19223497  73stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources0%○≥80% from editorially reviewed sources
[t]Trusted80%✓≥80% from verified, high-quality sources
[a]DOI67%○≥80% have a Digital Object Identifier
[b]CrossRef7%○≥80% indexed in CrossRef
[i]Indexed87%✓≥80% have metadata indexed
[l]Academic67%○≥80% from journals/conferences/preprints
[f]Free Access87%✓≥80% are freely accessible
[r]References15 refs✓Minimum 10 references required
[w]Words [REQ]2,302✓Minimum 2,000 words for a full research article. Current: 2,302
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19223497
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]73%✓≥60% of references from 2025–2026. Current: 73%
[c]Data Charts4✓Original data charts from reproducible analysis (min 2). Current: 4
[g]Code—○Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (74 × 60%) + Required (4/5 × 30%) + Optional (2/4 × 10%)

As large language models approach ceiling performance on standard benchmarks, the question shifts from "how smart is this model?" to "how smart is this model per unit of resource consumed?" This article proposes the UIB-Efficiency dimension — a resource-normalized intelligence score that integrates accuracy with computational cost, energy consumption, memory footprint, and inference latency. We...

Show moreHide
Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19223497 73stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources0%○≥80% from editorially reviewed sources
[t]Trusted80%✓≥80% from verified, high-quality sources
[a]DOI67%○≥80% have a Digital Object Identifier
[b]CrossRef7%○≥80% indexed in CrossRef
[i]Indexed87%✓≥80% have metadata indexed
[l]Academic67%○≥80% from journals/conferences/preprints
[f]Free Access87%✓≥80% are freely accessible
[r]References15 refs✓Minimum 10 references required
[w]Words [REQ]2,302✓Minimum 2,000 words for a full research article. Current: 2,302
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19223497
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]73%✓≥60% of references from 2025–2026. Current: 73%
[c]Data Charts4✓Original data charts from reproducible analysis (min 2). Current: 4
[g]Code—○Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (74 × 60%) + Required (4/5 × 30%) + Optional (2/4 × 10%)
Universal Intellig…Read More
Read more

Social and Collaborative Intelligence as a UIB Dimension: Why Theory of Mind Remains the Hardest Benchmark

Posted on March 24, 2026March 24, 2026 by
Benchmark Research
Benchmark Research by Oleh Ivchenko  ·  DOI: 10.5281/zenodo.19209792  67stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources7%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI20%○≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed100%✓≥80% have metadata indexed
[l]Academic87%✓≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References15 refs✓Minimum 10 references required
[w]Words [REQ]2,272✓Minimum 2,000 words for a full research article. Current: 2,272
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19209792
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]17%✗≥60% of references from 2025–2026. Current: 17%
[c]Data Charts4✓Original data charts from reproducible analysis (min 2). Current: 4
[g]Code—○Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (73 × 60%) + Required (3/5 × 30%) + Optional (2/4 × 10%)

Current AI evaluation overwhelmingly focuses on individual cognitive tasks — reasoning, coding, mathematics — while neglecting the social and collaborative capabilities that define human intelligence in practice. This article introduces the UIB-Social dimension, a formal evaluation framework for measuring social intelligence in large language models across four sub-dimensions: Theory of Mind (T...

Show moreHide
Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19209792 67stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources7%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI20%○≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed100%✓≥80% have metadata indexed
[l]Academic87%✓≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References15 refs✓Minimum 10 references required
[w]Words [REQ]2,272✓Minimum 2,000 words for a full research article. Current: 2,272
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19209792
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]17%✗≥60% of references from 2025–2026. Current: 17%
[c]Data Charts4✓Original data charts from reproducible analysis (min 2). Current: 4
[g]Code—○Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (73 × 60%) + Required (3/5 × 30%) + Optional (2/4 × 10%)
Universal Intellig…Read More
Read more

Temporal and Planning Intelligence as a UIB Dimension: Why Horizon Length Breaks Modern Reasoning Models

Posted on March 24, 2026 by
Benchmark Research
Benchmark Research by Oleh Ivchenko  ·  DOI: 10.5281/zenodo.19207333  81stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources0%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI77%○≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed100%✓≥80% have metadata indexed
[l]Academic85%✓≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References13 refs✓Minimum 10 references required
[w]Words [REQ]2,339✓Minimum 2,000 words for a full research article. Current: 2,339
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19207333
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]80%✓≥60% of references from 2025–2026. Current: 80%
[c]Data Charts4✓Original data charts from reproducible analysis (min 2). Current: 4
[g]Code—○Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (87 × 60%) + Required (4/5 × 30%) + Optional (2/4 × 10%)

Temporal reasoning and long-horizon planning represent perhaps the most consequential gap between current large language models and human cognitive capability. While frontier models achieve near-human performance on short planning tasks (under 15 steps), their accuracy degrades catastrophically beyond 25 planning steps — a phenomenon we term the horizon collapse. This article examines three res...

Show moreHide
Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19207333 81stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources0%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI77%○≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed100%✓≥80% have metadata indexed
[l]Academic85%✓≥80% from journals/conferences/preprints
[f]Free Access100%✓≥80% are freely accessible
[r]References13 refs✓Minimum 10 references required
[w]Words [REQ]2,339✓Minimum 2,000 words for a full research article. Current: 2,339
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19207333
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]80%✓≥60% of references from 2025–2026. Current: 80%
[c]Data Charts4✓Original data charts from reproducible analysis (min 2). Current: 4
[g]Code—○Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (87 × 60%) + Required (4/5 × 30%) + Optional (2/4 × 10%)
Universal Intellig…Read More
Read more

Embodied Intelligence as a UIB Dimension: Why Physical Grounding Is the Missing Benchmark

Posted on March 20, 2026 by
Benchmark Research
Benchmark Research by Oleh Ivchenko  ·  DOI: 10.5281/zenodo.19135583  69stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources5%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI48%○≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed100%✓≥80% have metadata indexed
[l]Academic90%✓≥80% from journals/conferences/preprints
[f]Free Access95%✓≥80% are freely accessible
[r]References21 refs✓Minimum 10 references required
[w]Words [REQ]2,982✓Minimum 2,000 words for a full research article. Current: 2,982
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19135583
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]22%✗≥60% of references from 2025–2026. Current: 22%
[c]Data Charts0○Original data charts from reproducible analysis (min 2). Current: 0
[g]Code—○Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (81 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)

Current intelligence benchmarks evaluate AI systems as disembodied reasoners operating on text, images, and symbolic tasks detached from physical reality. This article introduces Embodied Intelligence as a formal dimension within the Universal Intelligence Benchmark (UIB) framework, arguing that any comprehensive measure of machine intelligence must assess a system's capacity for sensorimotor g...

Show moreHide
Benchmark Research by Oleh Ivchenko DOI: 10.5281/zenodo.19135583 69stabilfr·wdophcgmx
BadgeMetricValueStatusDescription
[s]Reviewed Sources5%○≥80% from editorially reviewed sources
[t]Trusted100%✓≥80% from verified, high-quality sources
[a]DOI48%○≥80% have a Digital Object Identifier
[b]CrossRef0%○≥80% indexed in CrossRef
[i]Indexed100%✓≥80% have metadata indexed
[l]Academic90%✓≥80% from journals/conferences/preprints
[f]Free Access95%✓≥80% are freely accessible
[r]References21 refs✓Minimum 10 references required
[w]Words [REQ]2,982✓Minimum 2,000 words for a full research article. Current: 2,982
[d]DOI [REQ]✓✓Zenodo DOI registered for persistent citation. DOI: 10.5281/zenodo.19135583
[o]ORCID [REQ]✓✓Author ORCID verified for academic identity
[p]Peer Reviewed [REQ]—✗Peer reviewed by an assigned reviewer
[h]Freshness [REQ]22%✗≥60% of references from 2025–2026. Current: 22%
[c]Data Charts0○Original data charts from reproducible analysis (min 2). Current: 0
[g]Code—○Source code available on GitHub
[m]Diagrams3✓Mermaid architecture/flow diagrams. Current: 3
[x]Cited by0○Referenced by 0 other hub article(s)
Score = Ref Trust (81 × 60%) + Required (3/5 × 30%) + Optional (1/4 × 10%)
Universal Intellig…Read More
Read more

Posts pagination

  • 1
  • 2
  • Next

Recent Posts

  • Interpretable Models vs Post-Hoc Explanations: True Cost Comparison for Enterprise AI
  • XAI Tool Economics: The Cost Structure of Explanation Generation
  • Transparent AI Sourcing: Build vs Buy Economics When Explanations Matter
  • XAI Observability: Monitoring Explainability Drift in Production Models
  • Manufacturing AI Observability: Monitoring Explanation Quality in Predictive Maintenance Systems

Research Index

Browse all articles — filter by score, badges, views, series →

Categories

  • ai
  • AI Economics
  • AI Memory
  • AI Observability & Monitoring
  • AI Portfolio Optimisation
  • Ancient IT History
  • Anticipatory Intelligence
  • Article Quality Science
  • Capability-Adoption Gap
  • Cost-Effective Enterprise AI
  • Future of AI
  • Geopolitical Risk Intelligence
  • hackathon
  • healthcare
  • HPF-P Framework
  • innovation
  • Intellectual Data Analysis
  • medai
  • Medical ML Diagnosis
  • Open Humanoid
  • Research
  • ScanLab
  • Shadow Economy Dynamics
  • Spec-Driven AI Development
  • Technology
  • Trusted Open Source
  • Uncategorized
  • Universal Intelligence Benchmark
  • War Prediction

About

Stabilarity Research Hub is dedicated to advancing the frontiers of AI, from Medical ML to Anticipatory Intelligence. Our mission is to build robust and efficient AI systems for a safer future.

Language

  • Medical ML Diagnosis
  • AI Economics
  • Cost-Effective AI
  • Anticipatory Intelligence
  • Data Mining
  • 🔑 API for Researchers

Connect

Facebook Group: Join

Telegram: @Y0man

Email: contact@stabilarity.com

© 2026 Stabilarity Research Hub

© 2026 Stabilarity Hub | Powered by Superbs Personal Blog theme
Stabilarity Research Hub

Open research platform for AI, machine learning, and enterprise technology. All articles are preprints with DOI registration via Zenodo.

185+
Articles
8
Series
DOI
Archived

Research Series

  • Medical ML Diagnosis
  • Anticipatory Intelligence
  • Intellectual Data Analysis
  • AI Economics
  • Cost-Effective AI
  • Spec-Driven AI

Community

  • Join Community
  • MedAI Hack
  • Zenodo Archive
  • Contact Us

Legal

  • Terms of Service
  • About Us
  • Contact
Operated by
Stabilarity OÜ
Registry: 17150040
Estonian Business Register →
© 2026 Stabilarity OÜ. Content licensed under CC BY 4.0
Terms About Contact
Language: 🇬🇧 EN 🇺🇦 UK 🇩🇪 DE 🇵🇱 PL 🇫🇷 FR
Display Settings
Theme
Light
Dark
Auto
Width
Default
Column
Wide
Text 100%

We use cookies to enhance your experience and analyze site traffic. By clicking "Accept All", you consent to our use of cookies. Read our Terms of Service for more information.