[agi] Pick the Bit and Competitive Computing Platform - Towards a New Benchmark for AGI System Performance

dissipate Sat, 07 Dec 2024 21:51:10 -0800

Pick the Bit and Competitive Computing Platform - Towards a New Benchmark for 
AGI System Performance
2024-12-07 Version 0.1.0
Steven W. Kane


1. The Pick the Bit Game
*1.1 Game Overview*
Pick the Bit (PtB) is a turn-based, multi-agent (minimum of 2 agents but 
theoretically an unlimited number of agents) game where agents compete by 
guessing a binary value—either 0 or 1—each round. The goal is to avoid picking 
the bit chosen by the majority of agents. Agents that pick the majority bit 
lose health (when their health goes to 0 or below, the agent 'dies' and is 
removed from the game), and the game continues until only one agent remains.
*1.2 Game Mechanics*
Health Dynamics:
 * Each agent starts with a fixed amount of health points (HP).
 * Agents that guess the majority bit lose health points equal to a 
predetermined loss value.
 * Agents that guess the minority bit retain their health.
 * Health loss scales asymptotically in later rounds, increasing the stakes 
over time. The reason for this is because earlier rounds of the game are more 
random and a loss should not incur as much health loss.
Random Noise Agents:
 * At a minimum, one random agent with infinite health is always present to 
enable tie breaks when there are only two agents left.
 * Additional random agents in addition to the single random agent can be added 
from the beginning to increase random noise and maintain unpredictability.
 * These random agents choose their bits pseudorandomly based on a 
cryptographically secure PRNG with a securely selected seed value.
Hidden Information:
 * The health levels of other agents and the number of agents choosing each bit 
are hidden, forcing agents to infer patterns and make strategic guesses.
The only four things that an agent receives as inputs each round are:
 * The current round number.
 * The majority and minority bits from the previous round.
 * The agent's own current health level.
 * The amount of health that will be lost for a loss of the next round (also, 
the health loss schedule will be passed to the agent at the beginning of the 
game at a minimum).
Incentivizing Monetary Rewards:
 * Each round, agents that survive collect tokens, representing an equal share 
of the health points lost by the defeated agents.
 * The total tokens accrued by an agent are not revealed to any of the agents 
at all (including the agent that is assigned the tokens), and do not give any 
advantage in the game.
 * The tokens an agent ends up with at the end of the game can be redeemed for 
monetary rewards by the team that owns the agent at the end of the game.
 * A percentage of the prize pool is reserved for the game winner, ensuring 
that strategic play and survival remain paramount.
Game Complexity:
 * PtB rewards non-random play by favoring agents that detect and exploit 
patterns in opponents' choices. Random strategies are penalized over time due 
to predictable health loss.
 * Under the token system, pure random play by an agent will strongly tend 
towards monetary loss since a percentage of the prize pool is pre-allocated to 
the winning agent.

2. Running PtB on C2P
*2.1 The Competitive Computing Platform (C2P)*
 * The Competitive Computing Platform (C2P) is an isolated, 
resource-constrained environment for executing AI-generated agents. It enforces 
standardization across competitions, ensuring fairness and reproducibility.
*2.2 Agent Constraints*
WASM WASI Modules:
 * All agents are submitted as WebAssembly (WASM) WASI modules, ensuring 
portability and security.
Resource Limits:
 * Memory: Limited to 4 GiB.
 * Fuel: Execution is capped using Wasmtime's fuel feature to ensure 
computational fairness.
 * No Networking Access: Agents are entirely sandboxed, removing external 
dependencies or external learning.
Game State Communication:
 * Agents receive game state updates via shared memory and submit their moves 
back through the same mechanism. No external communication is permitted, 
ensuring that all strategies are self-contained.
*2.3 C2P Architecture*
Broker and Hosts:
 * The game broker orchestrates competitions, communicating game state updates 
to agent hosts and logging outcomes.
Single-Node Execution:
 * For simplicity, C2P competitions can run on a single node with all 
components (broker, Kafka instance, WASM modules) co-located.
 * Turn-Based Execution:
 * Each turn, agents receive the game state and submit their moves 
asynchronously. The broker processes all moves, calculates health adjustments, 
and updates the game state for the next round.
*2.4 Benchmarking Independence*
 * PtB on C2P can benchmark any AGI system, independent of its architecture. 
The only requirement is that the AGI system generates a WASM WASI agent for the 
PtB game.
 * This architectural independence ensures that C2P provides a level playing 
field for all AGI systems, allowing researchers and developers to focus on 
algorithmic sophistication rather than hardware or language-specific 
implementations.

3. PtB and C2P as a Benchmark for AGI Performance
*3.1 Benchmarking AGI Through PtB*
Pick the Bit is designed to test core AGI capabilities:
Strategic Adaptation:
 * AGI systems must adapt to the shifting meta-game, learning and optimizing 
strategies with limited feedback.
 * Pattern Recognition:
 * Detecting and responding to subtle patterns in agent behavior and game state 
is critical for survival.
Robustness Under Constraints:
 * The WASM WASI sandbox ensures that agent performance is tied solely to its 
algorithmic sophistication, not hardware advantages.
*3.2 C2P as a Universal Standard*
Decoupling from Hardware:
 * By requiring agents to run on commodity hardware with standardized 
constraints, C2P removes externalities, enabling direct comparisons between AGI 
systems.
Interoperability:
 * WASM WASI ensures agents can be developed in any language that compiles to 
WASM, making C2P accessible to a wide range of researchers and organizations.
Transparent Competitions:
 * C2P logs all game state updates and agent moves, providing a fully auditable 
record of each competition.
*3.3 Meta-Learning and AGI Evaluation*
Dynamic Agent Generation:
 * PtB encourages the use of meta-learning systems that dynamically generate 
agents tailored to the game environment.
 * By iteratively refining agents through competitions, AGI systems can 
demonstrate their ability to generalize, adapt, and innovate.

4. Conclusion
Pick the Bit (PtB) and the Competitive Computing Platform (C2P) together 
represent a new frontier in AGI benchmarking. PtB's dynamic and evolving 
meta-game challenges agents to excel in adaptability, pattern recognition, and 
strategic thinking, while C2P provides a standardized, resource-constrained 
environment for fair competition. By isolating agent performance from hardware 
advantages and enabling reproducible evaluations, PtB and C2P offer a universal 
platform for AGI research and benchmarking, pushing the boundaries of what 
intelligent systems can achieve. Through these competitions, the AI community 
can foster innovation, collaboration, and progress toward truly general 
intelligence.

Software: 
https://github.com/Competitive-Computing-Network/c2n/tree/main/software (proof 
of concept is a work in progress)

------------------------------------------
Artificial General Intelligence List: AGI
Permalink: 
https://agi.topicbox.com/groups/agi/T705ed500a1a7e589-Me1ad0ce1aaceee9b152d2022
Delivery options: https://agi.topicbox.com/groups/agi/subscription

[agi] Pick the Bit and Competitive Computing Platform - Towards a New Benchmark for AGI System Performance

Reply via email to