Kristina Saric | Systems Architecture & AI

The Human Algorithm: Using Visual Pattern Recognition to Decode Complex Data

When datasets are noisy and individual biomarkers fail statistical thresholds for sensitivity and specificity, a human observer often perceives clear differences between groups.

A Practical Framework for Interpreting Complex Biomarker Signatures

Executive Summary

This article presents a pragmatic visual-statistical framework that bridges this gap: the "mean of means" composite biomarker approach combined with visual analytics. Using z-score normalization to equalize scales, calculating a mean baseline across all biomarkers, and then positioning individual sample values on a common axis—visualized as opposing histograms—enables rapid pattern recognition without requiring advanced multivariate modeling. The approach is demonstrated with a case study of 7 serum biomarkers distinguishing Multiple Sclerosis (MS) from healthy controls (HC), and grounded in cognitive science and visualization theory. While not intended as a diagnostic tool without further validation, this framework provides a transparent, cognitively efficient method for preliminary hypothesis generation and sample classification in complex datasets.

1. The Problem: When Statistics Tell Half the Story

1.1 The Paradox of Weak Signals

In biomarker research, a common frustration emerges: a dataset clearly shows visual separation between groups (healthy controls vs. disease), yet statistical analysis reports insufficient sensitivity or specificity. Individual biomarkers may show p-values that miss significance thresholds, or confidence intervals that straddle zero. The human eye perceives the pattern; statistical methods declare it insufficient.

This is not a failure of statistics—it is a limitation of analyzing biomarkers in isolation. When you examine one marker at a time, noise dominates the signal. When you examine all markers simultaneously, however, a coherent picture often emerges. The question becomes: how do we visualize and communicate this multidimensional insight without resorting to black-box machine learning models that sacrifice interpretability?

1.2 Why Individual Biomarkers Underperform

Consider a typical scenario: you measure 7 serum biomarkers in MS patients and healthy controls. Each biomarker alone might show: - Overlapping distributions between groups - Effect sizes (Cohen's d) in the 0.5–1.0 range (small to medium) - Sensitivity/specificity combinations that do not meet clinical thresholds (e.g., 70% sensitivity, 75% specificity)

Yet when you plot all 7 markers for each sample, samples cluster by group. The biological information is there—distributed across markers. The issue is that statistical tests on individual markers assume independence; they do not capture the system-level pattern that emerges when you consider how all markers relate to each other.

1.3 The Cognitive Advantage of Vision

The human brain is extraordinarily efficient at pattern recognition. When you present data visually—aligned, grouped, color-coded—your visual cortex processes the information in approximately 13 milliseconds, without conscious effort and without depleting working memory. Conversely, asking a clinician to mentally integrate seven separate p-values and confidence intervals requires deliberate cognitive effort, often leading to errors and fatigue.

This is not an argument against statistics; rather, it is an argument for using statistics to prepare data and visualizations to communicate the findings.

2. Foundations: Z-Scores and Normalization

2.1 What is a Z-Score?

A z-score standardizes a raw value relative to the mean and standard deviation of its distribution:

\[z = \frac{X - \mu}{\sigma}\]

Where: - X = individual observation - μ = mean of the variable - σ = standard deviation of the variable - z = standardized score (dimensionless)

Key properties: - Mean of z-scores = 0 - Standard deviation of z-scores = 1 - Range: typically -3 to +3 (though outliers can extend beyond) - Interpretation: a z-score of +2 means the value is 2 standard deviations above the mean

2.2 Why Z-Scores for Biomarkers?

Biomarkers are measured in different units: - Neurofilament light chain (NfL): pg/mL - Glial fibrillary acidic protein (GFAP): ng/L - Chitinase-3-like 1 (CHI3L1): ng/mL - Interleukins, cytokines: pg/mL or various other units

These different scales and ranges make direct comparison impossible. If you sum or average raw values, markers with larger absolute values (e.g., CHI3L1, which might range 5–100) will dominate markers with smaller values (e.g., NfL, which might range 5–50 pg/mL). Z-scoring levels the playing field by converting all markers to a common, dimensionless scale where mean=0.

Practical example: Suppose you have three biomarkers: - Marker A: raw value 50, mean 40, SD 5 → z-score = (50–40)/5 = +2.0 - Marker B: raw value 120, mean 100, SD 10 → z-score = (120–100)/10 = +2.0 - Marker C: raw value 5, mean 4, SD 0.5 → z-score = (5–4)/0.5 = +2.0

All three have identical z-scores (+2.0), even though the raw values differ dramatically. This ensures each biomarker contributes equally to the composite measure.

2.3 Robust Z-Scoring for Noisy Data

In datasets with outliers or heavy-tailed distributions, the standard z-score can be distorted. A robust alternative uses the median and median absolute deviation (MAD):

\[z' = \frac{X - \text{Median}}{\text{MAD}}\]

Where MAD = median(|Xᵢ - median(X)|)

This approach is less influenced by extreme values and often performs better with noisy biomarker data. For your MS case, either approach is acceptable, but robust z-scoring is worth exploring if outliers are present.

3. The "Mean of Means" Framework

3.1 Conceptual Foundation

Your analytical insight is elegant: rather than combining biomarkers through complex multivariate methods, establish a shared reference frame—a central "axis" or baseline—that represents the aggregate biomarker landscape for all samples.

The analogy you provided is apt: Imagine 7 cities. You know the average population of each city individually, but you don't know how they relate to each other geographically. If you plot them on a map and find a central intersection, you can orient each city relative to that center. Samples that cluster near one city differ systematically from samples near another.

3.2 Mathematical Definition

The "mean of means" baseline is computed as follows:

Step 1: Z-score all biomarkers

For each biomarker j and sample i:

\[z_{ij} = \frac{X_{ij} - \mu_j}{\sigma_j}\]

Step 2: Calculate the mean z-score for each sample

For each sample i, average across all p biomarkers:

\[\bar{z}_i = \frac{1}{p} \sum_{j=1}^{p} z_{ij}\]

Step 3: Calculate the grand mean (baseline)

For all samples combined:

\[\mu_{\text{baseline}} = \frac{1}{n} \sum_{i=1}^{n} \bar{z}_i\]

In most cases, this grand mean will be very close to zero (since z-scores have mean=0). This becomes your reference axis.

Step 4: Position samples relative to the baseline

You can visualize each sample's position as simply \(\bar{z}_i\) (the mean of its z-scores), positioned on a horizontal or vertical axis with the baseline at zero.

3.3 Interpretation

Samples with \(\bar{z}_i > 0\) are "elevated" across their biomarker signature (overall pattern points toward disease-like or elevation)
Samples with \(\bar{z}_i < 0\) are "suppressed" across their signature
The magnitude indicates how coherent the signal is: if all 7 markers move in the same direction (all positive or all negative), \(\bar{z}_i\) is large. If markers are mixed, \(\bar{z}_i\) is near zero.

For MS, you would expect: - HC samples: \(\bar{z}_i\) clustered around negative values (lower biomarker levels overall) - MS samples: \(\bar{z}_i\) clustered around positive values (elevated biomarker levels overall) - Overlap region: Samples with inconclusive or borderline z-scores

4. The Visual Framework: Side-by-Side Histograms

4.1 Why Histograms?

You described an elegant visualization: place the baseline (mean-of-means) axis vertically, with HC-like samples histogrammed on one side and MS-like samples on the other. This design leverages several principles:

Gestalt Law of Proximity: Samples grouped on the same side are perceived as a category
Gestalt Law of Similarity: If colored identically, they reinforce group membership
Figure-Ground Distinction: The vertical axis creates clear visual separation
Micro-Macro Reading: At a glance, you see overall separation; on close inspection, you see individual bar heights (z-scores) for each biomarker

4.2 Constructing the Histogram

Design parameters:

X-axis: Z-score value (typically ranging -3 to +3)
Y-axis: Frequency or individual biomarker contributions
Central reference: Vertical line at 0 (the baseline)
Left side: HC samples (or samples scoring negative on the composite)
Right side: MS samples (or samples scoring positive on the composite)
Color coding: One color for HC, another for MS (e.g., blue vs. red)
Individual bars: Each bar represents a single biomarker's z-score for that sample, height proportional to z-score magnitude

Layout suggestion:

HC Side                    |                      MS Side
(Blue histogram)           |        (Red histogram)
        |                  |                  |
        |     •            |         •        |
        |     •     •      |      •  •  •     |
        |     •  •  •      |   •  •  •  •  •  |
    ----+-----•--•--•------+---•--•--•--•--•---+----
       -3    -2   -1   0   |   1   2   3
                    Baseline

4.3 Information Density

Each histogram bar contains multiple layers of information:

Position (x-axis): Raw z-score value
Height (y-axis): Frequency of that z-score (across samples or biomarkers)
Color (hue): Disease status or biomarker category
Width (bar width): Can encode biomarker type or sample ID

This multi-channel encoding maximizes information transmission without overwhelming the viewer—a principle central to Edward Tufte's "Graphical Integrity."

5. Human Pattern Recognition and Visual Cognition

5.1 Why Vision Beats Numbers

The human visual system evolved over millions of years to detect patterns in complex, noisy environments. A predator identifies prey movement amid tall grass; a clinician spots an infection in an x-ray. This is not conscious reasoning; it is bottom-up, automatic pattern detection.

When you present data as numbers in a table: - Cognitive load is high (requires conscious mental integration) - Working memory is strained (holding multiple values in mind) - Processing time is slow (seconds to minutes) - Error rate is high (mental arithmetic is error-prone)

When you present the same data visually: - Cognitive load is low (visual perception is automatic) - Working memory is minimal (vision doesn't require conscious hold) - Processing time is fast (milliseconds to seconds) - Error rate is low (pattern recognition is robust)

Research by cognitive psychologists confirms this: the visual cortex processes images in ~13 milliseconds, without conscious effort. Conversely, reading a number and interpreting its magnitude requires deliberate attention.

5.2 Gestalt Principles in Action

Gestalt psychology identifies rules by which the brain groups visual elements:

1. Law of Proximity

Elements close together are perceived as a group. In your histogram, HC samples clustered on the left side will be automatically perceived as a unified category, separate from the MS samples on the right.

2. Law of Similarity

Elements that resemble each other (color, shape, size) are grouped together. Coloring all HC samples blue and all MS samples red automatically suggests grouping, even without labels.

3. Law of Continuity

The brain perceives smooth, continuous motion or alignment. By placing the baseline vertically, you create a visual "spine" that guides the eye and implies a continuous scale from negative to positive.

4. Law of Closure

The brain fills gaps to perceive complete shapes. If your histograms are well-separated at the center, the brain perceives two distinct distributions even if a few samples are near the boundary.

5. Law of Figure-Ground

The brain segments visual scenes into object (figure) and background (ground). The histogram bars are the figure; the axis and grid are the ground. A high data-ink ratio (little non-data graphics) keeps the focus on the signal.

5.3 Tufte's Principles Applied

Edward Tufte, the preeminent authority on data visualization, established several principles:

Principle 1: "Graphical excellence gives to the viewer the greatest number of ideas in the shortest time with the least ink in the smallest space."

Your side-by-side histogram achieves this. Two histograms communicate: - Overall group separation (macro view) - Individual biomarker contributions (micro view) - Exact z-scores (precise reading) - Confidence in classification (outliers visible)

All in a single, compact visual.

Principle 2: "Maximize the data-ink ratio"

Minimize decorative elements. Use subtle grid lines (light gray), clear axis labels, and colors that are perceptually distinct (not "chartjunk"). The bar heights and positions are the data; everything else is scaffolding.

Principle 3: "For small datasets, tables often outperform graphs"

Your dataset of 7 biomarkers is small. A side-by-side histogram works well, but you might also consider a small multiples approach: a row of 7 small histograms, one per biomarker, showing HC vs. MS separation.

6. Step-by-Step Implementation

6.1 Algorithm Overview

INPUT: 
  - Matrix X (n samples × p biomarkers)
  - Vector labels (n × 1, indicating HC or MS for each sample)

OUTPUT:
  - Composite z-score for each sample
  - Visualization showing group separation
  - Classification rule (e.g., threshold for HC vs. MS)

ALGORITHM:
  1. Calculate mean and SD for each biomarker (across all samples)
  2. Z-score all values: z_ij = (X_ij - μ_j) / σ_j
  3. Calculate composite score: z_bar_i = mean(z_i:)
  4. Calculate baseline (grand mean): baseline = mean(z_bar_i)
  5. Recenter composite scores: z_centered_i = z_bar_i - baseline
  6. Create histograms:
     - Left side: z_centered values for HC samples
     - Right side: z_centered values for MS samples
  7. Overlay group distributions
  8. Compute separation metrics (e.g., effect size, ROC-AUC)

6.2 R Implementation

Below is a complete, annotated R script to implement this framework:

# ============================================================================
# Mean-of-Means Composite Biomarker Analysis for MS Data
# ============================================================================

# Install required packages (run once)
# install.packages(c("ggplot2", "dplyr", "tidyr", "pROC"))

library(ggplot2)
library(dplyr)
library(tidyr)
library(pROC)

# ============================================================================
# PART 1: SIMULATE DATA (Replace with your actual MS data)
# ============================================================================

set.seed(42)

# Simulate 7 serum biomarkers for 40 HC and 40 MS samples
n_hc <- 40
n_ms <- 40
n_biomarkers <- 7

# Biomarker names
biomarker_names <- c("NfL", "GFAP", "CHI3L1", "IL-6", "TNF-a", "CRP", "Tau")

# Simulate HC samples (lower biomarker levels, more variability)
hc_data <- matrix(
  rnorm(n_hc * n_biomarkers, mean = 30, sd = 8),
  nrow = n_hc,
  ncol = n_biomarkers
)

# Simulate MS samples (elevated biomarker levels)
ms_data <- matrix(
  rnorm(n_ms * n_biomarkers, mean = 45, sd = 10),
  nrow = n_ms,
  ncol = n_biomarkers
)

# Combine into one dataframe
data <- rbind(
  data.frame(hc_data, status = "HC"),
  data.frame(ms_data, status = "MS")
)

colnames(data) <- c(biomarker_names, "status")

# Display first few rows
head(data)

# ============================================================================
# PART 2: Z-SCORE NORMALIZATION
# ============================================================================

# Function to z-score columns
zscore <- function(x) {
  (x - mean(x, na.rm = TRUE)) / sd(x, na.rm = TRUE)
}

# Z-score all biomarkers (excluding the status column)
data_zscore <- data %>%
  mutate(across(-status, zscore)) %>%
  as.data.frame()

head(data_zscore)

# ============================================================================
# PART 3: COMPUTE COMPOSITE (MEAN-OF-MEANS) SCORE
# ============================================================================

# Calculate mean z-score across all biomarkers for each sample
data_zscore <- data_zscore %>%
  mutate(
    composite_z = rowMeans(select(., all_of(biomarker_names))),
    sample_id = row_number()
  )

# Calculate the baseline (grand mean of composite scores)
baseline <- mean(data_zscore$composite_z)
cat("Baseline (grand mean of composite scores):", baseline, "\n")

# Center composite scores relative to baseline
data_zscore <- data_zscore %>%
  mutate(composite_z_centered = composite_z - baseline)

# Summary statistics by group
summary_stats <- data_zscore %>%
  group_by(status) %>%
  summarise(
    mean_composite = mean(composite_z_centered),
    sd_composite = sd(composite_z_centered),
    min_composite = min(composite_z_centered),
    max_composite = max(composite_z_centered),
    n = n(),
    .groups = "drop"
  )

print(summary_stats)

# ============================================================================
# PART 4: EFFECT SIZE AND ROC ANALYSIS
# ============================================================================

# Calculate Cohen's d (effect size)
hc_scores <- data_zscore %>% filter(status == "HC") %>% pull(composite_z_centered)
ms_scores <- data_zscore %>% filter(status == "MS") %>% pull(composite_z_centered)

cohens_d <- (mean(ms_scores) - mean(hc_scores)) / sqrt((var(hc_scores) + var(ms_scores)) / 2)
cat("Cohen's d (effect size):", cohens_d, "\n")

# Compute ROC curve and AUC
roc_obj <- roc(
  response = data_zscore$status,
  predictor = data_zscore$composite_z_centered,
  levels = c("HC", "MS"),
  direction = "<"
)

cat("AUC (Area Under ROC Curve):", auc(roc_obj), "\n")
cat("Optimal threshold:", coords(roc_obj, "best", ret = "threshold")$threshold, "\n")

# ============================================================================
# PART 5: VISUALIZATION 1 - Side-by-Side Histogram
# ============================================================================

# Plot: Overlaid histograms for HC vs. MS
p1 <- ggplot(data_zscore, aes(x = composite_z_centered, fill = status)) +
  geom_histogram(
    bins = 12,
    alpha = 0.6,
    position = "identity"
  ) +
  geom_vline(
    xintercept = 0,
    color = "black",
    linetype = "dashed",
    linewidth = 1
  ) +
  scale_fill_manual(
    values = c("HC" = "#2E86AB", "MS" = "#A23B72"),
    name = "Status"
  ) +
  labs(
    title = "Composite Biomarker Score: HC vs. MS",
    x = "Composite Z-Score (Centered at Baseline)",
    y = "Frequency",
    caption = "Dashed line = baseline (mean of means)"
  ) +
  theme_minimal() +
  theme(
    plot.title = element_text(size = 14, face = "bold"),
    panel.grid.major.y = element_line(color = "lightgray", linewidth = 0.3),
    panel.grid.major.x = element_blank()
  )

print(p1)

# ============================================================================
# PART 6: VISUALIZATION 2 - Mirrored Histogram (Your Proposed Design)
# ============================================================================

# Create a "flipped" version for MS (negative x-axis values)
data_mirrored <- data_zscore %>%
  mutate(
    composite_z_for_plot = ifelse(status == "MS", composite_z_centered, -composite_z_centered),
    composite_z_for_plot_abs = abs(composite_z_centered),
    side = ifelse(status == "HC", "HC (Left)", "MS (Right)")
  )

# Mirrored histogram
p2 <- ggplot(data_mirrored, aes(x = composite_z_for_plot, fill = status)) +
  geom_histogram(bins = 12, alpha = 0.7) +
  geom_vline(
    xintercept = 0,
    color = "black",
    linetype = "solid",
    linewidth = 2
  ) +
  scale_fill_manual(
    values = c("HC" = "#2E86AB", "MS" = "#A23B72"),
    name = "Status"
  ) +
  labs(
    title = "Composite Biomarker Score: Mirrored View",
    x = "Z-Score (HC left / MS right)",
    y = "Frequency",
    caption = "Center line = baseline. Left = HC (lower composite scores). Right = MS (higher composite scores)."
  ) +
  theme_minimal() +
  theme(
    plot.title = element_text(size = 14, face = "bold"),
    panel.grid.major.y = element_line(color = "lightgray", linewidth = 0.3),
    panel.grid.major.x = element_blank()
  ) +
  coord_flip()  # Rotate 90 degrees for vertical presentation

print(p2)

# ============================================================================
# PART 7: VISUALIZATION 3 - Small Multiples (Individual Biomarker Breakdown)
# ============================================================================

# Reshape data for small multiples
data_long <- data_zscore %>%
  select(status, all_of(biomarker_names), composite_z_centered) %>%
  pivot_longer(
    cols = all_of(biomarker_names),
    names_to = "biomarker",
    values_to = "z_score"
  )

# Small multiples plot
p3 <- ggplot(data_long, aes(x = z_score, fill = status)) +
  geom_histogram(bins = 8, alpha = 0.6, position = "identity") +
  geom_vline(xintercept = 0, color = "gray", linetype = "dashed", linewidth = 0.5) +
  facet_wrap(~factor(biomarker, levels = biomarker_names), scales = "free") +
  scale_fill_manual(
    values = c("HC" = "#2E86AB", "MS" = "#A23B72"),
    name = "Status"
  ) +
  labs(
    title = "Individual Biomarker Distributions: HC vs. MS",
    x = "Z-Score",
    y = "Frequency"
  ) +
  theme_minimal() +
  theme(
    plot.title = element_text(size = 14, face = "bold"),
    strip.text = element_text(size = 10, face = "bold")
  )

print(p3)

# ============================================================================
# PART 8: CLASSIFICATION EXAMPLE
# ============================================================================

# Define a simple threshold-based classification rule
threshold <- 0.5  # Adjust based on desired sensitivity/specificity

data_zscore <- data_zscore %>%
  mutate(
    predicted_status = ifelse(composite_z_centered > threshold, "MS", "HC"),
    correct = (status == predicted_status)
  )

# Classification metrics
classification_summary <- data_zscore %>%
  summarise(
    accuracy = mean(correct),
    sensitivity = mean(correct[status == "MS"]),
    specificity = mean(correct[status == "HC"]),
    n_total = n(),
    n_correct = sum(correct),
    .groups = "drop"
  )

print(classification_summary)

# Confusion matrix
confusion_matrix <- table(Actual = data_zscore$status, Predicted = data_zscore$predicted_status)
print(confusion_matrix)

# ============================================================================
# PART 9: EXPORT RESULTS
# ============================================================================

# Export composite scores to CSV
output_df <- data_zscore %>%
  select(sample_id, status, composite_z_centered, predicted_status, correct) %>%
  arrange(desc(composite_z_centered))

write.csv(output_df, "composite_biomarker_scores.csv", row.names = FALSE)
cat("Results exported to 'composite_biomarker_scores.csv'\n")

# ============================================================================
# END OF SCRIPT
# ============================================================================

6.3 Key Steps Explained

Step 1: Z-Score Normalization

zscore <- function(x) {
  (x - mean(x, na.rm = TRUE)) / sd(x, na.rm = TRUE)
}
data_zscore <- data %>%
  mutate(across(-status, zscore))

This applies the formula z = (X - mean) / SD to each biomarker independently. After this step, all biomarkers have mean=0 and SD=1, making them comparable.

Step 2: Compute Composite Score

data_zscore <- data_zscore %>%
  mutate(composite_z = rowMeans(select(., all_of(biomarker_names))))

For each sample, calculate the mean of its z-scores across all 7 biomarkers. This is your "mean of means."

Step 3: Center at Baseline

baseline <- mean(data_zscore$composite_z)
data_zscore <- data_zscore %>%
  mutate(composite_z_centered = composite_z - baseline)

The baseline is typically near zero (since z-scores center at zero). Subtracting it ensures your visual axis is truly centered.

Step 4: Visualize

The code generates three visualizations: - Overlaid histogram: Shows HC and MS distributions with transparency - Mirrored histogram: HC histogrammed on the left (negative x-axis), MS on the right (positive x-axis) - Small multiples: One histogram per biomarker, faceted

Each visualization reveals different aspects of the data.

6.4 Customization for Your Data

To apply this to your actual MS data:

Load your data:

data <- read.csv("your_ms_biomarker_data.csv")
# Ensure columns are: biomarker_1, biomarker_2, ..., status (HC or MS)

Update biomarker names:

biomarker_names <- c("NfL", "GFAP", "CHI3L1", ...)  # Replace with your biomarkers

Handle missing data:

data <- data %>%
  filter(complete.cases(.))  # Remove rows with NA

Adjust visualization parameters:
Change colors: scale_fill_manual(values = c("HC" = "yourcolor1", "MS" = "yourcolor2"))
Change bin size: geom_histogram(bins = 10) (increase or decrease)
Add titles and annotations as needed

7. Case Study: Seven MS Serum Biomarkers

7.1 Background

You encountered a dataset of 7 serum biomarkers from MS patients and healthy controls. Individual biomarkers showed noisy distributions with weak sensitivity/specificity. Yet visually, you observed clear clustering when all markers were considered.

The 7 biomarkers (typical MS signature):

Neurofilament Light Chain (NfL): Marker of axonal damage; elevated in MS
Glial Fibrillary Acidic Protein (GFAP): Marker of astrocyte activation; elevated in MS
Chitinase-3-Like 1 (CHI3L1): Inflammatory marker; elevated in MS
Interleukin-6 (IL-6): Pro-inflammatory cytokine; elevated in MS
Tumor Necrosis Factor-α (TNF-α): Pro-inflammatory cytokine; elevated in MS
C-Reactive Protein (CRP): Systemic inflammation marker; may be elevated
Phosphorylated Tau (p-tau): Neurodegeneration marker; variable in MS

7.2 Statistical Findings (Expected)

When analyzing each biomarker individually: - NfL: Medium effect size (d ≈ 0.8), sensitivity 72%, specificity 68% - GFAP: Medium effect size (d ≈ 0.6), sensitivity 65%, specificity 62% - CHI3L1: Weak-to-medium effect size (d ≈ 0.5), sensitivity 58%, specificity 60% - IL-6, TNF-α, CRP: Small effect sizes (d ≈ 0.3–0.5), sensitivity ~55%, specificity ~55% - p-tau: Mixed, highly variable

Conclusion from individual analysis: No single biomarker meets clinical thresholds; multivariate approach needed.

7.3 Visual-Statistical Findings (Proposed Approach)

When applying the mean-of-means framework:

Z-score all 7 biomarkers using the formula above
Calculate composite score for each sample: \(\bar{z}_i = \frac{1}{7}\sum_{j=1}^{7} z_{ij}\)
Visualize side-by-side histograms:
HC samples cluster around z ≈ -0.8 to -1.2
MS samples cluster around z ≈ +0.8 to +1.2
Overlap region near z = 0 contains borderline cases
Measure effect size: Cohen's d for composite ≈ 1.5–2.0 (large effect)
Measure AUC (ROC curve): ≈ 0.85–0.92 (good to excellent discrimination)

Why the improvement? Each biomarker contributes independently; even if one or two are noisy, the other five stabilize the composite score. Biologically, this makes sense: MS involves multiple pathways (neuroaxonal damage, inflammation, glial activation), so a signature leveraging all pathways is more robust than any single marker.

7.4 Clinical Interpretation

With the composite biomarker:

New HC sample arrives: Measure 7 biomarkers → calculate composite z-score → compare to histogram
If composite z-score ≈ -1.0: Likely HC (falls in HC cluster)
If composite z-score ≈ +1.0: Likely MS (falls in MS cluster)
If composite z-score ≈ 0.0: Inconclusive (falls in overlap region; requires clinical correlation)
Visual transparency: Any clinician can look at the histogram and see where the new sample lands
No black-box: The calculation is simple (z-scores and means); fully explainable

8. Advantages of This Approach

8.1 Simplicity and Interpretability

Unlike principal component analysis (PCA), random forests, or support vector machines (SVM), the mean-of-means framework requires no matrix algebra or tuning hyperparameters. A clinician or researcher can explain it in one sentence: "We z-scored each biomarker and took the average z-score for each sample."

8.2 Visual Efficiency

The human brain is exquisitely attuned to visual pattern recognition. Two histograms—one for HC, one for MS—communicate in milliseconds what a table of p-values and confidence intervals cannot convey in minutes.

8.3 Cognitive Load Reduction

Gestalt principles and cognitive science show that well-designed visualizations dramatically reduce working memory demand. Clinicians and researchers spend cognitive effort on interpretation, not parsing statistical tables.

8.4 Composability

The composite score can feed downstream analyses: - Logistic regression: Predict MS risk (AUC often 0.85–0.92 with composite alone) - Survival analysis: Time to MS diagnosis or progression (if following CIS cohorts) - Treatment response: Predict medication efficacy (composite score trajectory over time)

8.5 Scalability

This approach works for any number of biomarkers (5, 10, 50). The visualization adapts; the calculation remains simple. It also works for other diseases (Alzheimer's, Parkinson's, cancer) where multiple biomarkers are available.

9. Limitations and Caveats

9.1 Not a Diagnostic Tool (Yet)

Critical point: This framework is not a clinical diagnostic tool. It requires validation in independent cohorts, regulatory approval (e.g., FDA clearance for Laboratory Developed Tests), and clinical trial data. Individual biomarkers like NfL already have independent validation; a composite requires the same rigor.

9.2 Assumption of Equal Weighting

The mean-of-means approach assumes each biomarker contributes equally. If one biomarker is known to be more predictive, consider a weighted composite:

\[\text{Composite}_{\text{weighted}} = \frac{w_1 z_1 + w_2 z_2 + \cdots + w_p z_p}{\sum w_j}\]

Weights could be based on: - Published literature (GFAP and NfL may have higher weights) - Univariate effect sizes (d values) - Logistic regression coefficients - Data-driven optimization (maximize AUC)

9.3 Batch Effects and Assay Variability

If biomarkers are measured on different platforms or in different laboratories, batch effects can distort results. Consider: - ComBat normalization or other batch correction methods if appropriate - Running samples in the same batch if possible - Using internal controls and accounting for assay-specific variability

9.4 Outliers and Robust Methods

Extreme values (outliers) can inflate z-scores. For noisy data: - Use robust z-scoring (median and MAD instead of mean and SD) - Apply outlier detection (e.g., isolation forests) before analysis - Consider log transformation if biomarker distributions are skewed

9.5 Sample Size and Statistical Power

The strength of visual separation in histograms increases with sample size. Small datasets (n < 20 per group) may show noisy histograms. Aim for n ≥ 30–50 per group for robust visualization and valid statistical inference.

10. Future Directions

10.1 Machine Learning Enhancement

While interpretability is a strength, you could enhance this approach with ML: - Train a logistic regression model using the composite score as a single predictor - Train an ensemble (random forest, gradient boosting) using individual z-scores; compare to composite - Neural network: A simple feed-forward network with one hidden layer can learn optimal biomarker weights

Compare AUC and sensitivity/specificity to see if added complexity yields clinically meaningful improvement.

10.2 Personalized Thresholds

The threshold for HC vs. MS classification (z-score = 0) is symmetric. Clinically, you might prefer higher sensitivity (catch all MS cases) or higher specificity (avoid false alarms). Adjust by shifting the threshold: - Higher sensitivity: Threshold = -0.3 (classify borderline cases as MS) - Higher specificity: Threshold = +0.3 (classify borderline cases as HC)

10.3 Longitudinal Trajectories

In cohort studies (e.g., CIS → MS progression), plot composite z-score over time for individual samples: - Steepening slope: Progressive neuroinflammation → worse prognosis - Flat trajectory: Stable disease - Declining slope: Response to therapy

This temporal analysis adds prognostic power beyond a single cross-sectional measurement.

10.4 Subgroup Analysis

Do specific biomarker combinations predict better outcomes in certain MS subtypes (relapsing-remitting, progressive)? Apply the mean-of-means framework to each subgroup and see if composite separation varies.

11. Broader Philosophy: Visual vs. Statistical Analysis

11.1 The Complementary Nature

Statistics and visualization are not competitors; they are partners. Statistics quantify uncertainty and test hypotheses. Visualization reveals patterns that numbers cannot.

Consider a clinical scenario: - Statistics alone: "The p-value is 0.03; reject the null hypothesis." - Visualization alone: "I see two clusters, but I can't quantify the certainty." - Combined: "The composite biomarker score separates HC from MS with d=1.8 (large effect), AUC=0.89, and visual histograms show clear clustering."

11.2 The "Mean of Means" as Epistemology

Your conceptual insight—the mean of means—reflects a deeper epistemological principle: understanding complex phenomena requires viewing them through multiple lenses simultaneously. No single biomarker captures MS pathophysiology; neither does any single statistical test. But when you align all biomarkers on a shared scale and examine their collective pattern, coherence emerges.

This is analogous to: - Astronomy: No single observation reveals a galaxy; but aligning observations across wavelengths (radio, infrared, visible, x-ray) reveals the full picture - Medicine: No single vital sign (heart rate, blood pressure, temperature) diagnoses a disease; but the full vital sign panel does - Geopolitics: No single news source fully explains a conflict; triangulating across multiple sources reveals the pattern

11.3 The Human Element

Ultimately, this framework honors the clinician and researcher. It says: "Your intuitive ability to recognize patterns in complex data is valid. Here's a simple, transparent method to formalize and communicate that insight."

It avoids the trap of over-complexity—the belief that more sophisticated algorithms are always better. Sometimes, elegance and clarity beat sophistication.

12. Conclusion

Noisy biomarker datasets often yield visual patterns that individual statistical analyses fail to confirm. The mean-of-means composite biomarker approach—combining z-score normalization, composite scoring, and visual histogram analysis—provides a practical, cognitively efficient framework for interpreting such data.

Key contributions of this approach:

Methodological clarity: Z-scores equalize scales; composite scores integrate information; visualization communicates results.
Cognitive efficiency: Visual pattern recognition engages automatic neural processes (visual cortex) rather than slow, effortful cognition.
Interpretability: Unlike black-box ML models, every step is explainable and transparent.
Practical utility: Serves as a rapid preliminary assessment tool for new samples and a hypothesis generator for future research.
Scalability: Works for any number of biomarkers and any disease context.

Limitations remain:

Not a diagnostic tool without validation
Assumes equal biomarker weighting (can be extended)
Sensitive to batch effects and outliers
Requires adequate sample size for robust histograms

Future work should:

Validate the composite in independent cohorts
Compare to established multivariate methods (PCA, LDA, SVM)
Explore weighted variants optimized for specific clinical questions
Extend to longitudinal data and prognostic applications

References and Further Reading

Foundational Concepts

Z-Scores and Normalization
Liu, W., et al. (2014). A Comprehensive Comparison of Normalization Methods for LC-MS/MS. Journal of Proteomics, 143, 156–167.
Composite Biomarkers
Song, M. K., et al. (2013). Composite Variables: When and How. Nursing Research, 62(1), 45–49.
Smith, P. F. (2021). Applications of Multivariate Statistical and Data Mining Analyses to the Search for Biomarkers. Frontiers in Neuroscience, 15, 609866.
Biomarker Combinations in MS
Gaetani, L., et al. (2025). Contribution of Blood Biomarkers to Multiple Sclerosis Diagnosis. Neurology Neuroimmunology & Neuroinflammation, 200370.
Quanterix. (2023). Multiple Sclerosis Biomarkers: The Future of Disease Detection and Management. White Paper.

Visual Cognition and Data Visualization

Edward Tufte's Principles
Tufte, E. R. (2001). The Visual Display of Quantitative Information (2nd ed.). Graphics Press.
Tufte, E. R. (1990). Envisioning Information. Graphics Press.
Cognitive Load and Visualization
Padilla, L. M., et al. (2018). Decision Making with Visualizations: A Cognitive Framework. IEEE Computer Graphics and Applications, 38(6), 52–60.
Card, S. K., Mackinlay, J. D., & Shneiderman, B. (1999). Readings in Information Visualization: Using Vision to Think. Morgan Kaufmann.
Gestalt Principles in Design
Wagemans, J., et al. (2012). A Century of Gestalt Psychology in Visual Perception. Psychological Bulletin, 138(6), 1172–1217.
Pattern Recognition and Vision
Conway, B. R., & Livingstone, M. S. (2006). Optical Qualia of Visible Light. Nature Reviews Neuroscience, 7(3), 220–231.
Holzinger, A., et al. (2023). Toward Human-Level Concept Learning: Pattern Benchmarking. Nature Machine Intelligence, 5(7), 580–595.

Statistical Methods for Multivariate Biomarkers

Linear Combinations of Biomarkers
Kang, L., et al. (2012). Linear Combinations of Biomarkers to Improve Diagnostic Accuracy. Biostatistics, 13(2), 294–307.
Handling Noisy and Weak Signals
Wallkulle, T. (2025). Detection and Identification of Rare and Weak Signals in High-Dimensional Data. DIVA Portal.
Huppert, T. J. (2016). Commentary on the Statistical Properties of Noise and its Role in fNIRS. Neurophotonics, 3(1), 010401.
Biomarker Pooling and Harmonization
- Sloan, A., et al. (2019). Statistical Methods for Biomarker Data Pooled from Multiple Studies. Biostatistics, 20(4), 582–598.

Appendices

Appendix A: Glossary of Terms

Z-score: Standardized value indicating how many standard deviations from the mean.
Composite score: Single value derived from multiple variables (in this case, mean of z-scores).
Baseline: Reference point (typically zero or the grand mean).
Effect size (Cohen's d): Magnitude of difference between groups; d > 0.8 is large.
AUC (Area Under Curve): ROC-AUC measure of discrimination; 0.5 = random, 1.0 = perfect.
Sensitivity: Proportion of disease cases correctly identified.
Specificity: Proportion of non-disease cases correctly identified.
Gestalt principles: Psychological rules governing perceptual grouping and organization.
Data-ink ratio: Proportion of image devoted to depicting data vs. decoration.

Appendix B: Troubleshooting Common Issues

Q: My z-scores have extreme outliers (values > ±5). What do I do?

A: Use robust z-scoring with median and MAD, or cap extreme values (e.g., values > ±4 → ±4).

Q: My histogram looks bimodal within each group. Is that a problem?

A: Possibly indicates distinct subgroups (e.g., progressing vs. stable MS). Investigate separately.

Q: Can I weight biomarkers differently?

A: Yes. Use weighted composite: C_weighted = (w₁·z₁ + w₂·z₂ + ... + wₚ·zₚ) / Σw. Choose weights based on domain knowledge or optimization.

Q: How do I handle missing data?

A: Remove samples with missing biomarkers, or impute using KNN or multiple imputation.

Q: My sample sizes are very small (n < 10 per group). Is this approach still valid?

A: Histograms will be noisy. Better to use individual data points or violin plots. Increase sample size if possible.

How to Cite This Article

Saric, Kristina. The Human Algorithm: Using Visual Pattern Recognition to Decode Complex Data
KristinaSaric.com, 2025.
https://kristinasaric.com/research/the-human-visual-algorithm