Eucalculia v6.3

Eucalculia v6.3 Guide

What Eucalculia trains

Eucalculia trains numerical cognition through two currently active modes: Numerical Sense and Relational Reasoning. Numerical Sense trains rapid recognition of numerical value across visual and symbolic representations. It includes true subitizing, but also structured value recognition in formats such as abaci, coins, Roman numerals, clocks and digits. Relational Reasoning trains the ability to encode values, retain them briefly, evaluate relations between them, compute derived values, and compare or synthesize conclusions across multiple episodes.

The current v6 architecture is deliberately narrower than older multi-mode versions: it keeps Mode A and Mode D because they are the two most coherent strands of the program. Mode A targets fast value recognition across perceptual, structured and symbolic number formats. Mode D implements a Posner-style delayed-question paradigm and adds hierarchical relational integration: L0, L1, L2 and L3. In the current build, Relational Reasoning defaults to Auto integration: the app introduces L1, L2 and L3 according to performance, with low-frequency preview probes before full activation.

Main screen and controls

Training Mode: choose Numerical Sense or Relational Reasoning. Numerical Sense shows one represented value and asks for the number it encodes. Relational Reasoning shows two represented values, hides the question until after the flash, and asks for a relation or a derived conclusion.
Relational Integration Levels: available only in Relational Reasoning. Auto is the recommended setting: L1, L2 and L3 unlock from recent performance and remain adaptively dosed. Manual L0/L1/L2/L3 settings are still available for isolating a specific layer. Level 0 gives standard Posner-style YES/NO relational trials. Level 1 adds numeric integration after two L0 trials. Level 2 adds four-option semantic conclusion tasks after two L1 trials. Level 3 adds synthesis across two L2 questions.
Block Size: number of trials before the summary screen. Auto is recommended in Relational Reasoning: it uses 20 trials normally and 30 when Meta³ is active or in preview. Manual Meta³ still clamps to at least 30 trials because a full L3 setup requires two complete L2 questions.
RT Target: shows the current response-time target. It is not a countdown. It is the threshold used to classify responses as fast or satisfactory for the current mode and level.
Representations: selects which visual formats may appear. All means all non-digit visual formats. Core selects the fastest, most rapid-recognition-friendly formats. None blocks Start. Digits uses only Arabic numerals.
Start Block: starts a new block and applies menu changes. You can also press Space from the menu for a fast start. Resume Block appears when a block is paused and continues the current block without changing its configuration.

Auto relational integration

Auto is the recommended setting for Relational Reasoning. In Auto, the user does not manually decide whether the block should reach L1, L2 or L3. The app starts from L0 and unlocks higher integration layers from recent performance.

Each meta level has three possible states:

Locked: the prerequisite layer has not yet produced enough evidence.
Preview: the user is close enough to the threshold that the next level appears occasionally as a low-frequency probe. Preview prevents the user from being trapped just below a hard cutoff, but it is not a full unlock.
Active: the prerequisite layer is stable enough for the level to appear normally, although Auto can still dose it down if performance becomes shaky.

The HUD shows the current Auto ceiling subtly. For example, META • AUTO means L1 is currently available; META² • PREVIEW • AUTO means L2 is being probed at low frequency; META • REDUCED • AUTO means a previously available layer is being retained at lower density during gradual decay. When a new Auto layer first becomes available, the app opens a fresh memory window: it does not immediately ask a question based on episodes shown before the unlock notice. The menu gives the detailed status for L1, L2 and L3, including the reason for each state.

Manual L0/L1/L2/L3 settings remain available for targeted practice, debugging or deliberate isolation of one layer. Manual mode applies the chosen ceiling directly and does not use Preview gating.

Keyboard, touch and review navigation

Number answers: use the on-screen numpad or keyboard digits. In early Numerical Sense ranges the on-screen 0 key is hidden because values below 10 never need it; it reappears when the range or task can require 0, 10 or 20.
YES/NO answers: tap YES/NO, press Y/N, or use left arrow for YES and right arrow for NO.
Four-option answers: read the four options in the frame, then tap 1–4 or press number keys 1–4. A/B/C/D remain keyboard aliases for compatibility.
Pause: press Escape or tap Menu. If the guide is open, Escape closes the guide first.
Review: after an error, the REVIEW phase shows the correct answer or the selected-versus-correct option. It advances automatically after a level-specific delay. You can also continue early by pressing Space or Enter, or by tapping/clicking anywhere in the app, after a short guard delay. Answer keys (digits, Y/N, 1–4/A–D aliases, arrows) are intentionally ignored during REVIEW so a key intended for the next trial is not silently consumed.

Response-time interpretation

Response time is measured in milliseconds with performance.now(). Numerical Sense accepts answers only after the stimulus and mask have finished. Relational L0 also accepts answers only after the two represented values and the mask have finished; at that moment the delayed question, answer controls and RT start together. For L1, L2 and L3, RT starts as soon as the question and answer controls appear. Reading and comprehension are therefore part of the measured relational task.

⚡ <300 msVery fast / likely automatic recognition

✓ <600 msFast target for Numerical Sense

○ <1800 msGood target for L0 relational trials

L1 <3800 msGood target for numeric integration

L2 <6500 msGood target for semantic conclusion selection

L3 <9000 msGood target for synthesis across prior conclusions

Do not treat every slow answer as failure. At L1–L3, slower correct answers can still be useful because the task involves retention, computation and language. The adaptive system uses both accuracy and speed, but accuracy remains the first requirement.

Timing engine, exposure time and mask

ET means exposure time: how long the stimulus remains visible. Earlier versions measured this in display frames, which produced different durations on 60 Hz, 90 Hz and 120 Hz screens. v6 measures prep, flash, mask and review phases in wall-clock milliseconds.

Prep: a short preparation phase before the stimulus. Flash: the visual stimulus is shown. Mask: a brief procedural noise mask interrupts visual persistence so the user cannot rely on afterimages. Input: the app accepts a response. Review: after errors, the app shows corrective feedback.

During Input, a very subtle blue halo and bottom line mark that responses are currently accepted. This cue is intentionally peripheral and non-green, so it signals readiness without being confused with correct-answer feedback. The adaptive training focus is deliberately not shown during active trials, because semantic labels such as equality, comparison or threshold relations can bias encoding in a delayed-question task.

In Relational Reasoning, a memory cue appears only when a later meta-question has already been probabilistically scheduled. On mobile, the cue is centered near the top of the frame rather than compressed into the corner, so it is visible without covering either stimulus. When a new MEM state appears or advances, the cue gives a brief visual ping and then remains still. It uses the target level and chain position: 🧠¹ MEM 1/2 for a pair feeding META, 🧠² MEM 1/2 for a META result feeding META², and 🧠³ MEM 1/2 for a META² question feeding META³. If a marked prerequisite is answered incorrectly, the review does not cancel the chain; it shows a compact semantic repair card with the corrected unit to retain. If the nominal block limit is reached while a MEM chain is active or due, the block enters a short drain phase and continues until the promised META / META² / META³ question has been answered.

The mask is not part of the answer. It is a visual interruption. It is generated dynamically with fine-grain noise so it is less blocky and less unpleasant than earlier masks while still preventing the stimulus from lingering visually.

How to interpret visual stimuli

Every stimulus represents a number. The core rule is: answer or reason about the quantity represented, not about whether the pictures look identical, unless the question explicitly asks about format or representation.

Same number? means same numerical value. A die showing five and a tally showing five are the same number.
Same format? means same visual representation type. Two dice are the same format; a die and a tally are different formats, even if both represent the same number.
Left / right: in relational mode, the first quantity is displayed on the left side and the second on the right side. Some wordings reverse the perspective, such as “Right > Left?” or “Right is smaller than left?”. Read the wording carefully.
Larger / smaller: these refer to numerical size, not visual size on the screen. If values tie, categories that need a unique larger/smaller value are avoided or handled by the generator.
Difference / distance: these mean absolute numerical distance. The difference between 3 and 8 is 5.
Even/odd type: same type means both even or both odd. Mixed type means one even and one odd.
Prime: 2, 3, 5, 7, 11, 13, 17, 19… are prime; 1 is not prime.
Square: a perfect square in the displayed range, such as 1, 4, 9 or 16.

Visual representations

Dots

Discrete dots arranged as an ungrouped quantity field. Use this representation for direct numerosity recognition: the answer is the total number of visible dots.

Cluster

Dots separated into compact, color-coded chunks. The chunks are there to support groupitizing: perceiving a total through subgroups in the subitizing range (typically 2–4 dots per group, 2–4 groups). When all groups have the same size — for example 3+3+3 or 4+4+4 — they share a single color and shape, so the total can be read off as a multiplication (3×3, 3×4) rather than a sum. When groups differ in size, colors and shapes diverge so the additive structure (4+3+2) stays visible. The answer is always the total number of dots, not the number of clusters.

Frame

A ten-frame or twenty-frame layout. Filled cells represent the quantity. Useful for seeing numbers as structured parts of 10 or 20.

Dice

Standard pip patterns. These support rapid recognition through familiar spatial configurations.

Domino

Two grouped halves. Read the total across both halves unless a relational question asks you to compare two separate displayed quantities.

Tally

Vertical marks grouped in fives. Four vertical strokes plus a diagonal stroke represent five.

Fingers

Raised fingers represent the quantity. One hand usually covers 1–5; two hands can represent larger values.

Abacus

A simplified positional representation. Pink/red beads on the upper rod = 5; green beads on the lower rod = 1. Combine them to obtain the displayed value: for example, two upper beads and three lower beads represent 10+3 = 13.

Cards

Playing-card style pips. Interpret the total number of pips, not the card suit.

Cubes

Isometric blocks arranged in a spatial layout. The target is the count of cubes, even when perspective makes the display less flat.

Coins

Coins use visual differences to encode value classes. Copper = 1, silver = 2, and gold = 5. Silver and gold coins have a reeded rim, and denomination also receives a mild non-numeric size cue: copper is slightly smaller, gold slightly larger. Gold adds a single inner ring. Denominations are not printed as numbers; infer the values from color, rim and size coding and add them. For example, one gold, one silver and two copper coins represent 5+2+1+1 = 9.

Roman numerals

Roman notation such as I, IV, V, IX and X. These are symbolic rather than pictorial, but useful for transfer to real-world notations.

Clock

A clock face or hour-hand style display. Read the indicated hour/number.

Grid

Rectangular arrangements that support factor, area and multiplication intuitions. The value is the number of filled cells or units.

Digits

Arabic numerals. This option removes visual quantity decoding and lets the user focus on relational logic or symbolic magnitude.

Mode A — Numerical Sense

Mode A shows a single represented numerical value. Type the number it encodes. The ideal strategy is to recognize value from structure, pattern or notation, not count item by item. With dots, dice or ten-frames this often means subitizing or groupitizing; with abaci, coins, clocks, Roman numerals or digits it means fast value recognition from a learned numerical code.

The app tracks exposure time separately for value×representation combinations.
Fast correct answers reduce exposure time. Errors increase exposure time.
Less-mastered values and confusions can reappear more often.
Level-up uses a dynamic global criterion rather than requiring near-total mastery of every active value×representation cell. The first jumps are deliberately lighter, then the criterion becomes stricter as the range grows. Every active value still needs coverage, and the current top value must be mastered in a small sufficient core of representations. Difficult representations can lag behind their own soft frontier, so a new global value does not appear everywhere at once.

Mode D — Level 0 relational evaluation

L0 is the base Posner-style task. Two quantities flash side by side. The question is hidden during the flash, then appears after the mask. Because the question is delayed, you must encode both quantities abstractly before knowing which relation will be tested. Each side is capped at 10 in Relational Reasoning, so the task remains a two-sided relational encoding problem rather than a high-numerosity visual-counting problem.

Examples: “Are the two numbers the same?”, “Is the left number greater than the right number?”, “Are both numbers both even or both odd?”, “Do the two numbers add up to more than 10?”, “Is the larger number even?”, “Is the difference between the two numbers prime?”, “Can one number be divided exactly by the other?”. v6.3 adds explicit value-format relations (“same value, different format?”), quantifiers (“exactly one value is prime?”), range-half membership, one-step counterfactuals, proportional checks, dynamic-center distance relations and compact compound frame checks. This build also adds advanced transformation checks such as “if the smaller number increased by 2...” and two carefully limited compound-frame prompts that combine comparison with parity or gap conditions. Answer YES or NO.

Direct comparison

Same number / equal value / same quantity: ignores visual format unless the question says format.

Left greater / right greater / left smaller: compare numerical values by side.

Differ by 1 / differ by 2 / gap conditions: compare the absolute distance between values.

Same format: asks about representation type, not quantity.

Joint numerical properties

Same even/odd type, both even/odd, mixed parity: evaluate whether values are even or odd.

Both > 5, both ≤ 4, both prime, both square, both multiples of 3: both values must satisfy the property.

Sum = 10, sum > 10, sum even, sum prime: compute the total of the two displayed values.

Product > 20: multiply the two displayed values.

Role-binding and conditional relations

Larger value is even / square / closer to 10: first identify the larger value, then evaluate its property.

Smaller value is odd / closer to 10: first identify the smaller value.

Exact division: evaluate whether one value can be divided exactly by the other.

Difference equals smaller, larger = 2 × smaller + 1, difference > half the larger: evaluate a relation between a derived gap and a role-bound value.

L0 categories are not all equally likely from the beginning. The app uses soft relational frontiers: direct comparisons appear first, then joint numerical properties, role/property binding, metric and factor relations, and finally compound relations. The current value range still matters, but recent accuracy, response time, relational-frame weaknesses and recent sampling concentration also shape which categories are emphasized.

Level 1 — numeric integration across two L0 trials

L1 appears after two L0 trials. It asks for a numeric answer computed from the two retained pairs. You must use the values from both previous pairs, not just the last pair.

Typical L1 operations include: sum of all four values; largest or smallest of all four; second-largest or second-smallest; sum of the larger value from each pair; sum of the smaller value from each pair; difference between pair totals; left-column and right-column totals; range across all four; side-transfer operations such as “use the side of pair 2 that matched a role in pair 1”; filters such as “add only values that appeared once” or “add the prime values”; property counts; and conditional numeric operations such as “use the higher-total pair” or “use the pair with the smaller maximum”. This build also adds contextual role-transfer operations: for example, identifying the side of the larger or smaller number in pair 1 and then entering the value from that same side, or from the opposite side, in pair 2. Related transfer prompts can use the unique even value or the side closer to the current range center. L1 remains a numeric-answer level: it does not use arbitrary selector codes.

All L1 answers are non-negative. Difference questions are framed as gaps or distances unless the wording explicitly defines an order. Type the numeric result on the numpad.

Level 2 — semantic conclusion selection

L2 appears after two L1 trials. Instead of typing a number, you select the only fully correct conclusion from four options. The options refer to the two L1 answers and sometimes to how those answers were obtained.

Pure answer relations ask whether answer 1 was greater than answer 2, whether both answers were equal, whether they had the same parity, or whether their combined total had a property. Threshold relations ask whether the answers were close enough, whether their sum exceeded a calibrated limit, whether both answers exceeded a calibrated limit, or whether both answers stayed at most at a calibrated limit. v6.3 also adds exact-double, less-than-half, multiple-of-3 and shared-factor relations.

More advanced L2 variants bind answer relations to the previous calculation type. For example, the correct conclusion may involve both the size of the answers and whether a prior calculation used a role-based operation, a threshold, a gap, a position rule, a side-transfer rule or a conditional rule. Distractors are not random: they are controlled flips of comparison, parity, threshold or calculation provenance.

Level 3 — synthesis across two L2 questions

L3 appears after two L2 trials. It presents four compact statements or rules about the two previous META² questions. Exactly one statement is correct.

Each retained META² question stores only public information: the corrected L2 conclusion, the difference between the two META answers, the visible kind of question, whether a number limit was used, whether the question referred to how the META answers were calculated, and the relevant answer pattern. If you selected the wrong L2 option, L3 still uses the corrected L2 conclusion. The REVIEW screen gives that corrected conclusion so you can update memory before L3 appears.

L3 is therefore not a memory test of your own mistakes. It is a synthesis task over corrected semantic questions. During L3, the frame no longer shows an auxiliary summary strip of the previous META² questions; the user must rely on the memory chain and select among the four visible statements. Current templates use public labels such as “number limit”, “how the META answers were calculated”, “left/right position” and “difference between the two META answers”, while hidden implementation tags and opaque internal task-type comparisons are suppressed.

Relational progression in Auto

Auto uses the previous layer as evidence for the next layer. L1 depends on L0 readiness; L2 depends on L1 readiness; L3 depends on L2 readiness.

L1 Active: recent L0 performance is stable: enough trials, good accuracy, acceptable response time and coverage of multiple L0 families.
L1 Preview: the user has accumulated extended L0 practice and is near the target accuracy/RT, so occasional L1 probes appear.
L2 Active: L1 has enough successful numeric-integration evidence across several operation families.
L2 Preview: L1 is not fully stable yet, but the user is close enough that occasional Meta² conclusion tasks can test readiness.
L3 Active: L2 conclusion selection is stable enough across at least two Meta² families, so synthesis is not based on one repeated conclusion type.
L3 Preview: L2 performance is close to readiness and has covered at least two Meta² families, so rare Meta³ probes may appear. Blocks of 30 or more trials remain better for Meta³ because a full L3 cycle requires two complete L2 questions.

Preview states are intentionally conservative. They are not rewards and not diagnosis; they are adaptive probes. If the user performs well, the level can become Active. If performance is weak, the level remains rare while the app reinforces prerequisite layers. v6.2.43 uses gradual Auto decay: once META, META² or META³ has appeared, it can pass through reduced-presence retention and decaying Preview before locking again, so isolated errors or rolling-window noise do not make levels flicker.

How reviews work

Correct answers usually advance quickly. Errors enter REVIEW. The review screen shows the correct answer, or for four-option tasks, your selected option against the correct one. The current delays are intentionally generous: approximately 1.5 s for L0, 2.6 s for L1, 4.0 s for L2 and 5.0 s for L3. These delays are meant to keep corrective feedback readable, especially in relational and memory-repair reviews.

On small touch devices, you can now tap anywhere during REVIEW to continue early. This does not affect scoring; it only shortens the visible review time after you have seen enough.

Adaptive system

Numerical Sense: the app adapts exposure time by value and representation. Fast correct responses shorten future exposure; errors increase it. Very fast responses (<300 ms) reduce exposure more than merely correct responses. For cell mastery, the strict baseline remains demanding, but a cell can earn limited extra tolerance from repeated fast evidence. Errors degrade this tolerance gradually. Global level-up is now separated from local representation frontiers: the range can open earlier, while weaker representations continue to train lower values and receive only probabilistic previews above their current frontier.

Relational mode: L0 uses a protected global exposure time rather than separate exposure times for each value. This avoids making the display too fast before new relational categories are learnable. Novel categories receive temporary exposure floors.

Auto integration: in Relational Reasoning, Auto computes a current ceiling from L0/L1/L2 readiness. Higher layers are not simply on or off: they can be Locked, Preview, Reduced or Active. Preview gives low-frequency probes just below the full unlock threshold. If a layer loses support after appearing, it now decays through reduced-presence retention and decaying Preview before locking.

Relational frontiers: L0 categories, L1 operations, L2 conclusion types and L3 synthesis dimensions are staged by difficulty. The system does not permanently exclude harder families, but it gives easier families more probability until the evidence supports the next stage.

Sampling balance: the generator balances YES and NO structurally. It first chooses the intended truth value, then builds a pair that satisfies it. Categories are filtered by value range, weighted by recent accuracy and speed, and softly adjusted by relational-frame weaknesses such as coordination, opposition, comparison, reversal, role-binding, conditionality, metric reasoning and threshold reasoning.

Anti-concentration safeguards: adaptive weights are compressed with a temperature transform and capped relative to the median weight so one category cannot become a probability black hole. A recent-window concentration monitor checks whether one family or category is dominating L0; if dominance is excessive, it now applies a temporary corrective bias against that family or category and reports the correction in the browser console.

Block summary and training focus

At the end of a block, the summary shows accuracy, average RT, fast responses and best streak. In Relational Reasoning, the training-focus panel summarizes what the app emphasized using human-readable labels and a short explanation.

Mode A summaries report visible progress even when no level-up occurs: strengthened cells, newly mastered cells and newly global-ready cells. Mode D summaries now use a matching compact card for relational momentum: relation types practiced, stable relation types, trained layers, current focus and next focus.

The final summary is not a diagnostic report. Detailed family, frame and layer breakdowns are kept in the Training Log so the block-complete screen stays readable.

Progress indicator, log and sound feedback

The progress indicator is now a pre-block orientation cue. At the start of each block, before the first stimulus appears, it shows labelled progress toward the next range or relational tier. In Relational Reasoning, the adaptive focus for the block appears only during this pre-block orientation phase and then disappears before the first trial. During this readable intro, the game frame and three small lights move through a red/amber/green sequence, like a restrained F1-style start signal. Each color has a very short local sound cue when sound feedback is enabled. This tells the user to wait, read the progress bar, register the focus if present, and prepare. Once the first trial begins, both the progress bar and the focus cue disappear so they do not add visual or semantic noise during stimulus encoding.

Sound feedback is continuous and stratified. Correct responses receive a very short confirmation cue; fast responses receive a brighter cue; very short automatic responses receive a small mastery cue; errors receive a brief low cue. Larger achievements still have distinct sounds: level-ups, Auto unlocks, streak milestones, milestone unlocks and strong block completion. The pre-block red/amber/green semaphore also has three brief preparation cues. Transition cues are now lightly serialized, so clicking Continue before a new block cannot mask the red semaphore cue. Streak sounds are subtly graded by streak length: every five correct responses can add a capped higher harmonic, so longer streaks feel different without becoming intrusive.

Continue Training and Take a Break also use different short cues. Continue has a small celebratory lift; Take a Break uses a calmer disconnect sound. Pressing Escape during an active block now uses this same calmer break cue before opening the menu, so deliberate exits from play are sonically consistent. All sounds are generated locally through the Web Audio API; no external audio files are loaded.

Use Sound feedback in the menu to switch sounds on or off. The setting is saved with progress export/import. The block summary, collapsible Training Log, Progress History chart and Milestones panel use lightweight visual structure to make progress easier to read without turning the app into a diagnostic dashboard. The Training Log is hidden by default and can be opened when detailed adaptation information is useful. Qualitative end-of-block comments, milestone text and detailed next-range status are kept in the Training Log; the summary card itself stays focused on metrics, compact progress momentum and the dedicated milestone popup.

Long-term progress and milestones

The menu now includes a collapsed Progress History panel. It stores only compact block summaries, not trial-by-trial data. The chart can show accuracy by block, average response time, and the reached value range in Numerical Sense or active/reached meta-level in Relational Reasoning. The panel now opens and closes with a short, non-intrusive animation so the menu feels less abrupt.

The RT view is mode-aware. In Numerical Sense it shows average block RT. In Relational Reasoning it can draw separate lines for L0, L1, L2 and L3 when those levels were present in saved blocks. The chart now includes restrained reference lines and can show the exact saved block under the pointer or touch position.

The menu also includes optional Milestones. They are checked only after a block ends and are stored in the local profile. They do not change scoring, level-up criteria, exposure time, sampling weights or Auto progression. They are displayed as a compact icon grid, with short-, medium- and long-term goals such as block volume, perfect blocks, long automatic-response totals, representation choices, range milestones and reaching Meta/Meta²/Meta³. Locked milestones show partial progress where it is meaningful, so the panel can act as a quiet medium-term target list rather than only as a trophy shelf. Newly unlocked milestones also appear as a dedicated popup in the block summary, then remain visible in the Milestones panel. Like Progress History, this panel opens and closes with a brief animation.

Practical training recommendations

Use short blocks: 10–20 trials are usually enough for focused Numerical Sense practice. In Relational Reasoning, Block Size Auto is recommended: it keeps ordinary blocks at 20 trials and expands to 30 when Meta³ can appear. If a marked MEM chain reaches the nominal end of the block, the block may extend briefly so the pending meta-question is actually asked.
Leave Relational Integration on Auto for ordinary training: manual L0/L1/L2/L3 is best used when you intentionally want to isolate a layer.
Prioritize accuracy first: speed is only meaningful once the representation or relation is understood.
Do not count in Mode A if you can avoid it: the goal is fast pattern recognition.
In L0, encode both values before reading the question: the hidden question is designed to force abstract encoding rather than question-specific scanning. Individual relational stimuli are capped at 10; because two values are shown, sums and derived META answers can still naturally reach 20 or more.
In L1, retain pairs as ordered pairs: remember pair 1 and pair 2, and left and right positions, because later questions may use position or pair-of-origin.
In L2 and L3, read the options in the frame: the lower buttons are only selectors 1–4; distractors are deliberately close to the correct answer.
Use the review: after a wrong L2, read the correct conclusion because it may be needed for L3.
Mix representations once stable: Core is good for speed; All is better for flexible transfer.
Use Digits when isolating logic: if visual decoding is interfering with relational reasoning, train the logic separately with digits.

Reference Appendix — technical catalogue

This appendix is closed by default because it is meant for audit, debugging and advanced users. The main guide above explains how to play; this section documents the core catalogue and operating parameters. v6.3 extends the relational banks with additional value-format, quantifier, counterfactual, dynamic-center, filter, transformation and public-L3 templates; the in-code catalogue is the authoritative source for the full expanded set.

Visual representations and limits

Representation	Allowed value range	Interpretation
dots	1–20	Ungrouped dot field. Answer the total dot count.
cluster	1–15	Separated color-coded chunks using varied 2–5 sized partitions where possible. Answer the total dot count, not the number of groups.
ten / frame	1–30	Ten-frame / twenty-frame / thirty-frame grid depending on value.
dice	1–24	Up to four dice; each die contributes 1–6 pips.
domino	1–36	Up to three dominoes; each half ranges 0–6.
tally	1–20+	Tally marks grouped by fives; no explicit upper filter in allowedReps.
finger	1–10	One or two hands; raised fingers encode the value.
abacus	1–20+	Upper pink/red beads = 5 each; lower green beads = 1 each; no explicit upper filter in allowedReps.
card	1–39	Up to three playing cards; A=1, J=11, Q=12, K=13.
cube	1–20+	Isometric cube array; no explicit upper filter in allowedReps.
coins	1–20+	Copper = 1, silver = 2, gold = 5; denominations encoded by color, mild size, reeded rim and gold inner ring.
roman	1–20+	Roman numerals; symbolic, not a pure visual quantity.
clock	1–12	Clock hand points to the value; numerals are not printed on the clock.
grid	1–40	Dynamic rectangular grid of filled cells.
digit	1–20+	Arabic numeral; use this to isolate relational logic from visual decoding.

For representations without an explicit upper filter, the practical upper value is normally bounded by the current training level and by the global value ladder. Mode A can progress through 4, 5, 6, 7, 8, 9, 10, 12, 15, 18 and 20. Relational Reasoning caps each displayed side at 10, because two simultaneous values already create a natural combined range up to 20 before META operations add further load.

Timing, response targets and review delays

Parameter	Value	Meaning
prepMs	250 ms	Preparation phase before the stimulus.
maskMs	150 ms	Procedural mask after the stimulus.
reviewMsByLevel[0]	1500 ms	L0 / Numerical Sense review delay.
reviewMsByLevel[1]	2600 ms	L1 review delay.
reviewMsByLevel[2]	4000 ms	L2 review delay.
reviewMsByLevel[3]	5000 ms	L3 review delay.
reviewSkipGuardMs	100 ms	Minimum delay before touch/keyboard can close REVIEW early.
autoMs	250 ms	Short automatic transition delay.
blockIntroMs	1650 ms	Readable pre-block orientation phase before the first stimulus. The progress indicator and red/amber/green frame semaphore appear only during this phase; each color can trigger one short local audio cue when sound is enabled. Start cues are serialized behind transition sounds such as Continue so they remain audible.
metaUnlockNoticeMs	1200 ms	Readable transition pause after Auto META / META² / META³ unlock notices. A new Auto layer also opens a fresh memory window, so the first question at that layer uses newly encoded prerequisite items rather than material shown before the unlock.
metaUnlockSummaryMs	1400 ms	Summary delay when a block ends immediately after a level or Auto meta transition, so the transition remains readable.
relational input onset	Immediate after mask for L0; immediate on question render for L1–L3	No unscored reading phase. The question, answer controls and RT start together. Input remains blocked during PREP, FLASH and MASK.
memory cue	🧠¹/²/³ MEM 1/2 → 2/2	Memory cue shown only after the app has already scheduled a future meta question. On mobile it is centered near the top of the frame; on larger screens it remains compact. It gives one brief onset ping when the retained item changes, then stays still. The superscript marks the target meta layer and the fraction marks progress through the memory chain. Marked errors produce a semantic repair card rather than cancelling the chain.
startETByMode.A	750 ms	Initial exposure time in Numerical Sense.
startETByMode.D	1330 ms	Initial exposure time in Relational Reasoning.
minETByMode.A	70 ms	Exposure floor for Numerical Sense.
minETByMode.D	600 ms	Protected exposure floor for L0 relational encoding.
speedStepByMode.A	34 ms	Exposure reduction after strong Numerical Sense performance; this is just above two 60 Hz frames.
Numerical Sense mastery threshold	Adaptive: base minET + 2 speed steps = 138 ms; max functional tolerance minET + 6 speed steps = 274 ms	A value×representation cell counts as mastered if its ET reaches the strict base threshold, or if repeated fast evidence earns limited extra tolerance. Slow correct answers add a small amount of confidence; errors reduce it gradually.
Numerical Sense level-up rule	Dynamic: ≥45% first tier / ≥60% early / ≥75% middle / ≥85% late globally-ready active cells + every active value covered + current top value ready in a small core of 2–3 representations	Unlocks the next value tier as a motivational global range increase. Local per-representation frontiers still decide which formats may show the new value normally; lagging formats continue lower values and receive only staged previews.
speedStepByMode.D	17 ms	Slower exposure reduction for relational mode.
penalty	170 ms (damped by cell evidence)	Exposure increase after errors. In Mode A, cells with high accumulated score receive up to 40% damping so a single error on a near-mastered cell does not erase several blocks of progress.
dNovelCatETFloors	930 ms first 4 trials; 770 ms first 8 trials	Temporary exposure floor for newly introduced L0 categories.
RT targets	300 / 600 / 900 / 1800 / 3800 / 6500 / 9000 ms	Numerical Sense automatic, Numerical Sense fast, fast L0, good L0, good L1, good L2, good L3.

Controls, block flow and navigation

Control / state	Meaning
Numpad / digits	Number entry for Numerical Sense and L1. The on-screen 0 key is hidden in early Numerical Sense ranges and restored when the active range or task can require it.
YES / NO	Touch buttons, Y/N keys, or left/right arrows for L0.
1–4	Four-option tasks show all answers in the frame. Use touch buttons 1–4 or number keys 1–4; A–D remain aliases for compatibility.
Escape	Pause during gameplay with the Take-a-Break cue; closes the guide first if the guide is open; from the block summary, triggers Take a Break.
Tap during REVIEW	Skips the remaining review delay without changing scoring.
Start Block / Space	Applies menu changes and starts a new block. Space works from the menu when the guide is closed.
Block Size Auto	Recommended for Relational Reasoning. Uses 20 trials by default and 30 when Meta³ is active or in preview, so upper meta levels are not accidentally blocked by 10-trial blocks. If the nominal limit is reached while a MEM chain is active or due, the block drains that pending chain before showing the summary.
Resume Block	Continues the paused block without applying menu changes.
Reset All	Removes local saved state, including adaptive progress, progress history and milestones, and reloads the app.
Training Log	Collapsed detailed menu log with current settings, recent performance, adaptive status, pending cells/categories, historical/milestone counts and the last block note. Qualitative comments are kept here instead of occupying the block-complete report.
Progress History	Collapsed menu chart showing saved block summaries for accuracy, RT and level/range. It opens/closes with a brief animation and is informational only.
Milestones	Collapsed menu collection of optional achievement markers. Newly unlocked milestones also appear as a block-summary popup. The panel never affects progression or trial selection.

Meta sequence	Trigger	Meaning
L0	Every base relational trial	Two quantities flash; a delayed relation is answered YES/NO.
L1	After 2 L0 trials	Combines both retained L0 pairs into a numeric answer.
L2	After 2 L1 trials	Selects the only valid semantic conclusion about the two L1 answers.
L3	After 2 L2 trials	Synthesizes two previous META² questions.
Full Meta³ cycle	15 trials	L0, L0, L1 repeated twice → L2; repeated twice → L3. A nominal block can extend beyond its selected size only to discharge already-marked memory chains.

Level 0 relational question families

Each category has several wording variants. The table lists the internal ID, the first canonical wording, its unlock tier and its functional family.

Tier	ID	Canonical prompt	Family	RFT tags
1	identical	Same number?	direct	coordination
1	left_gt	Is the left number greater than the right number?	direct	comparison, reversal
1	diff_one	Are the two numbers exactly 1 apart?	direct	comparison, metric_relation
1	same_rep	Same format?	direct	coordination, source_relation
2	same_parity	Are both numbers both even or both odd?	joint	coordination, opposition
2	both_gt_5	Are both numbers greater than 5?	joint	comparison, threshold_relation
2	both_small	Are both numbers 4 or less?	joint	comparison, threshold_relation
2	sum_gt_10	Do the two numbers add up to more than 10?	joint	comparison, threshold_relation
2	sum_eq_10	Do the two numbers add up to exactly 10?	joint	coordination, metric_relation
3	diff_gt_3	Is the difference between the numbers greater than 3?	joint	comparison, threshold_relation, metric_relation
3	sum_even	Sum is even?	joint	metric_relation
3	left_double	Is the left number exactly twice the right number?	role	comparison, reversal
3	both_prime	Both prime?	joint	coordination, metric_relation
3	product_gt_20	Do the two numbers multiply to more than 20?	joint	comparison, threshold_relation
3	diff_even	Is the difference between the two numbers even?	joint	metric_relation
3	both_mult3	Both multiples of 3?	joint	coordination, metric_relation
2	larger_even	Larger value is even?	role	role_binding, metric_relation
2	smaller_odd	Smaller value is odd?	role	role_binding, metric_relation
2	diff_two	Differ by 2?	role	coordination, metric_relation
2	gap_prime	Is the difference between the two numbers prime?	role	metric_relation
2	smaller_divides_larger	Can the larger number be divided exactly by the smaller number?	role	role_binding, metric_relation
2	larger_square	Is the larger number a square number?	role	role_binding, metric_relation
2	left_closer_5	Is the left number closer to 5 than the right number?	role	comparison, reversal, metric_relation
3	larger_closer_10	Is the larger number closer to 10 than the smaller number?	role	role_binding, metric_relation
3	smaller_closer_10	Is the smaller number closer to 10 than the larger number?	role	role_binding, metric_relation
4	gap_equals_smaller	Is the difference equal to the smaller number?	role	role_binding, metric_relation
3	larger_one_more_double_smaller	Is the larger number one more than twice the smaller number?	role	role_binding, comparison, metric_relation
3	closer10_even	Is the number closer to 10 even?	role	role_binding, metric_relation
3	larger_prime_smaller_even	Is the larger number prime and the smaller number even?	role	role_binding, metric_relation
3	one_divides	Can one number be divided exactly by the other?	role	metric_relation
3	left_divides_right	Can the right number be divided exactly by the left number?	role	comparison, reversal, metric_relation
4	sum_prime	Do the two numbers add up to a prime number?	joint	metric_relation
4	xor_even	Exactly one even?	joint	opposition, metric_relation
4	diff_gt_half	Is the difference greater than half of the larger number?	role	comparison, threshold_relation, metric_relation
4	both_square	Both perfect squares?	joint	coordination, metric_relation
4	left_triple	Is the left number at least three times the right number?	role	comparison, reversal, threshold_relation
4	smaller_plus2_reaches_larger	If the smaller number increased by 2, would it reach or pass the larger number?	role	counterfactual_relation, transformation_relation, role_binding, comparison, threshold_relation
4	double_smaller_gt_larger	If the smaller number were doubled, would it be greater than the larger number?	role	counterfactual_relation, transformation_relation, role_binding, comparison, proportional_relation
4	larger_minus1_still_gt	If the larger number went down by 1, would it still be greater?	role	counterfactual_relation, transformation_relation, role_binding, comparison
4	left_gt_and_both_even	Is the left number greater, and are both numbers even?	compound	comparison, coordination, metric_relation, quantifier_relation
4	larger_even_and_gap_gt2	Is the larger number even AND is the gap more than 2?	compound	role_binding, comparison, metric_relation, threshold_relation, coordination

Level 1 numeric operations

L1 appears after two L0 trials and computes one numeric answer from the four retained values. The minLevel column is the current value-level gate used by the generator.

ID	Family	minLevel	Canonical prompt
sum_all	sum4	0	Sum of all four values?
max_all	global_extreme	0	Largest of all four values?
min_all	global_extreme	0	Smallest of all four values?
sum_maxima	role_sum	0	Add the larger of each pair?
sum_minima	role_sum	0	Add the smaller of each pair?
sum_diff	directional_diff	0	Difference between the two pair totals?
max_diff	directional_diff	0	Gap between the two pair maxima?
min_diff	directional_diff	0	Gap between the two pair minima?
same_side_as_p1_max	role_transfer	3	Which value in pair 2 was on the same side as pair 1's larger value?
same_side_as_p1_min	role_transfer	3	Which value in pair 2 was on the same side as pair 1's smaller value?
opposite_side_from_p1_min	role_transfer	3	Which value in pair 2 was on the opposite side from pair 1's smaller value?
same_side_as_p1_even	property_position_transfer	3	Which value in pair 2 was on the same side as pair 1's even value?
opposite_side_from_p1_even	property_position_transfer	3	Which value in pair 2 was on the opposite side from pair 1's even value?
same_side_as_p1_closer10	anchor_role_transfer	4	Which value in pair 2 was on the same side as pair 1's value closer to 10?
same_side_as_p1_closer_center	anchor_role_transfer	4	Which value in pair 2 was on the same side as pair 1's value closer to the current range center?
opposite_side_from_p1_closer_center	anchor_role_transfer	4	Which value in pair 2 was on the opposite side from pair 1's value closer to the current range center?
min1_plus_max2	cross_role	1	Smaller of pair 1 plus larger of pair 2?
max1_plus_min2	cross_role	1	Larger of pair 1 plus smaller of pair 2?
cross_role_gap_1	cross_role_gap	2	Difference between pair 1's smaller value and pair 2's larger value?
cross_role_gap_2	cross_role_gap	2	Difference between pair 1's larger value and pair 2's smaller value?
gap_sum	gap	1	Add the two within-pair differences?
gap_diff	gap_diff	1	Difference between the two within-pair differences?
max_gap	gap	2	Larger of the two within-pair differences?
min_gap	gap	2	Smaller of the two within-pair differences?
left_total	position	1	Add the two left-side values?
right_total	position	1	Add the two right-side values?
side_total_diff	position	2	Difference between left-side and right-side totals?
left_values_gap	position_gap	2	Difference between the two left-side values?
right_values_gap	position_gap	2	Difference between the two right-side values?
range_all	global_order	1	Largest minus smallest across all four?
extremes_sum	global_order	2	Largest plus smallest across all four?
middle_sum	global_order	2	Add the two middle values?
middle_gap	global_order	3	Difference between the two middle values?
minima_distance_10	base10	2	How far is the sum of the two smaller values from 10?
maxima_distance_10	base10	2	How far is the sum of the two larger values from 10?
left_total_distance_10	base10_position	3	How far is the left-side total from 10?
right_total_distance_10	base10_position	3	How far is the right-side total from 10?
min_from_higher_total	conditional	3	Smaller value from the higher-total pair? (tie: pair 1)
max_from_lower_total	conditional	3	Larger value from the lower-total pair? (tie: pair 1)
gap_from_higher_total	conditional	3	Gap from the higher-total pair? (tie: pair 1)
total_from_closer_pair	conditional	4	Total of the closer-together pair? (tie: pair 1)
max_from_wider_gap	conditional	4	Larger value from the wider-gap pair? (tie: pair 1)
count_even	property_count	2	How many of the four values are even?
count_prime	property_count	2	How many of the four values are prime?
count_gt5	property_count	2	How many of the four values are greater than 5?
count_distinct	property_count	3	How many distinct values were shown?
count_same_parity_pairs	property_count	3	How many pairs had same-parity values?
double_min_higher_total	composition	4	Double the smaller value from the higher-total pair. (tie: pair 1)
gap1_plus_min2	composition	4	Pair 1 gap plus pair 2's smaller value?
gap2_plus_min1	composition	4	Pair 2 gap plus pair 1's smaller value?

Level 2 conclusion families

L2 compares two previous L1 answers. Pure categories use direct logical predicates; threshold categories compute a calibrated threshold before rendering the question.

ID	Family	minLevel	Kind	Canonical prompt
l2_gt	answer	0	pure	Was the first result greater than the second result?
l2_eq	answer	0	pure	Were the two results equal?
l2_parity	answer	0	pure	Were both results the same type: both even or both odd?
l2_sum_even	answer	1	pure	Do the two results add up to an even number?
l2_close	threshold	0	threshold	Are the two results at most {N} apart?
l2_sum_gt	threshold	1	threshold	Do the two results add up to more than {N}?
l2_both_large	threshold	1	threshold	Are both results greater than {N}?
l2_gap_even	metric	1	pure	Is the difference between the two results even?
l2_gap_prime	metric	2	pure	Is the difference between the two results prime?
l2_larger_double	metric	2	pure	Is the larger result at least twice the smaller result?
l2_strict_factor	metric	2	pure	Can the larger result be divided exactly by the smaller result?
l2_same_distance_10	metric	2	pure	Are both results equally far from 10?
l2_closer_10	metric	3	pure	Is the first result closer to 10 than the second result?
l2_same_op_group	source	3	pure	Were the last two META questions the same kind of task?
l2_same_gap_dependency	source	3	pure	Did both last META questions use a difference within a pair?
l2_same_position_dependency	source	3	pure	Did both last META questions use left/right position?
l2_gap_rule_larger	source	4	pure	Did the calculation using a difference within a pair produce the larger answer?
l2_position_rule_larger	source	4	pure	Did the left/right-position task produce the larger result?
l2_same_transfer_dependency	source	4	pure	Did both previous META questions use pair 1 to choose a side in pair 2?
l2_transfer_task_larger	source	4	pure	Did the side-transfer META question produce the larger answer?

Level 3 synthesis families

L3 is generated dynamically from two prior META² questions. It does not use a fixed question bank; it builds four-option summaries from the features below and then inserts controlled distractors.

Family	Feature ID / pattern	Meaning
question fact	same_winner	Whether the larger META answer appeared in the same position in both META² questions.
question fact	same_gap_parity	Whether the two answer differences had the same even/odd type.
question fact	episode1_larger_gap	Whether the first META² question had the larger answer difference.
structure	same_family	Whether both META² questions asked the same kind of visible question.
structure	same_threshold_use	Whether both matched in number-limit use.
structure	same_source_use	Whether both matched in asking how the META answers were calculated.
bound synthesis	family_gap_bind	Binds visible question kind to larger/smaller answer difference.
bound synthesis	threshold_gap_bind	Binds number-limit use to larger/smaller answer difference.
bound synthesis	source_gap_bind	Binds calculation-source use to larger/smaller answer difference.
bound synthesis	relation_gap_bind	Binds what the META² question was about to larger/smaller answer difference.

Relational-frame tags used by adaptation

Tag	Meaning
coordination	same/equal/matching relations.
opposition	not/different/exactly-one relations.
comparison	greater/less/larger/smaller relations.
reversal	same relation expressed from the opposite side.
role_binding	relations bound to larger, smaller, closer or selected values.
conditional	choose a pair by one rule, then compute another property.
source_relation	relations involving representation, calculation type or provenance.
metric_relation	difference, distance, even/odd, prime/square/divisibility properties.
threshold_relation	relations involving number limits such as >N or ≤N.
sequence_relation	pair-to-pair change relations, such as how a role-bound value changed from pair 1 to pair 2.

These tags are training metadata. Recent errors and slow correct answers can increase the sampling weight of categories carrying the weaker tags.

Auto relational integration thresholds

Layer	Active condition	Preview condition	Presence in Auto
L1 / META	Recent L0: ≥28 trials, ≥84% accuracy, RT ≤1900 ms, ≥2 families.	Extended L0: ≥40 trials, ≥78% accuracy, RT ≤1.25×L0 target, ≥2 families.	Preview ≈16%; Active ranges from reduced presence to full presence depending on L0 stability.
L2 / META²	L1: ≥12 trials, ≥80% accuracy, RT ≤4300 ms, ≥3 operation families.	L1: ≥10 trials, ≥72% accuracy, RT ≤1.30×L1 target.	Preview ≈11%; Active is dosed down if L1 is not strongly stable.
L3 / META³	L2: ≥8 trials, ≥76% accuracy, RT ≤7200 ms.	L2: ≥6 trials, ≥68% accuracy, RT ≤1.30×L2 target.	Preview ≈7%; Active remains lower-density than L1/L2 because full Meta³ cycles are long.

The precise numbers are training heuristics, not diagnostic thresholds. They exist to regulate progression and prevent both premature overload and indefinite L0 purgatory.

Adaptive timing, progression and balancing

Mode A adaptation: exposure time is tracked by representation × value, so mastery of one format does not automatically accelerate another format. The current build separates global range progression from per-representation frontiers: the user can level up from early global-ready evidence, while difficult formats lag behind the global max and receive 35% / 12% / 4% staged previews one, two or three steps above their frontier when adjacent evidence supports it. It also adds a 1.5% far-frontier exposure at the current global top so a representation more than three steps behind does not fossilize.
Mode A error penalty: the 170 ms exposure-time penalty after an error is damped by up to 40% on cells with high accumulated evidence (score). A single error on a near-mastered cell therefore does not erase several blocks of progress, while cells with little evidence still receive the full penalty.
Mode D exposure: L0 uses a protected global relational exposure time. This prevents the stimulus from becoming too brief before new relational categories have been learned. D does not use Mode-A-style representation × value frontiers.
Auto integration states: each meta unlock is staged. L1, L2 and L3 can be Locked, Preview, Reduced or Active. Preview introduces low-frequency probes; Active allows normal but still adaptive presence. A gradual decay layer protects newly available meta levels from immediate disappearance and suppresses repeated unlock announcements after brief dips.
Auto thresholds: L1 depends on recent L0 accuracy/RT/family coverage; L2 depends on L1 trials, accuracy, RT and operation-family evidence; L3 depends on L2 trials, accuracy and RT. Preview thresholds are lower than Active thresholds and use reduced presence probabilities.
L0 truth balancing: the generator chooses the intended truth value first, then builds a pair satisfying it. This reduces YES/NO base-rate bias.
L0 category weighting: categories with lower recent accuracy or slower recent RT receive more sampling weight, subject to value feasibility, relational stage, family weighting, relational-frame weakness and recent concentration penalties.
Mode A mastered-cell weighting: cells already meeting the functional mastery condition remain possible but receive reduced sampling weight, so new and weaker cells are trained more efficiently.
Weight stabilization: Posner family/category weights use temperature compression and a median-ratio cap to reduce extreme probability ratios after multiple adaptive multipliers are combined.
Concentration monitor: recent L0 family/category share and entropy are checked internally. Excessive concentration applies a temporary corrective bias to the over-dominant family or category; the browser console reports the correction for audit.
L1 operation weighting: weaker operations and undertrained operation families are sampled more often, but the generator still respects minLevel, value feasibility and staged operational frontiers.
L2/L3 balancing: conclusion and synthesis tasks are built category-first with controlled distractors so the correct option cannot be inferred from natural base rates. L3 also uses a complexity budget so synthesis dimensions do not all arrive at once.
Four-option review: selected and correct choices are marked after the response. The review can auto-complete or be skipped by touch during REVIEW.

Saved state, reset and diagnostics

Area	Stored / affected fields	Meaning
localStorage key	eucalculia_v6_2_27	Current browser storage key, retained for compatibility. Current saved payload uses schemaVersion 17.
Saved configuration	mode, reps, metaLevel, metaAuto, autoMetaStable, blockSize, blockSizeAuto	Restores the active setup. In Relational Reasoning, metaAuto stores whether the integration ceiling is automatic or manually capped; autoMetaStable keeps the gradual-decay buffer for Auto levels; blockSizeAuto stores the mode-aware block-size setting.
Saved progression	levelByMode, etByMode, etByModeAndValue	Keeps adaptive levels and exposure times.
Saved performance	stats, statsByMode, posnerCatStats, frameStats, meta stats	Preserves long-term adaptation and menu diagnostics.
Saved gamification layer	progressHistory, achievements	Stores compact block summaries for the menu chart and optional milestone unlocks. No trial-by-trial history is stored in this layer.
Capped dictionaries	confusion, problemValues	Pruned to prevent unbounded localStorage growth.
Reset behavior	disable persistence + removeItem + reload	Clears adaptive state, progress history and milestones, then restarts from defaults. Persistence is disabled before reload so beforeunload/visibilitychange cannot re-save the old in-memory state.

Developed by Alberto Flaño Lombardo

linkedin.com/in/alberto-flaño-lombardo-762618259

Based on research in numerical cognition, subitizing, Relational Frame Theory, the Posner paradigm, and Halford's relational complexity theory.

EUCALCULIA 6.3