Batch and Plate Confounding

How Licklider detects when batch or plate structure is confounded with experimental groups, and what options are available when confounding is found, including where the current detection has limits.

When samples are processed in batches or on plates, the batch or plate can introduce systematic variation that is unrelated to the experimental treatment. If all samples from one group were processed in one batch and all samples from another group in a different batch, the measured difference between groups may reflect batch effects as much as — or more than — the actual treatment effect.

This is called confounding: the batch and the group are so closely linked that their effects cannot be separated from the data alone.


What Licklider checks

When a batch ID column, plate ID column, or run order column is present in the dataset, Licklider checks whether the batch structure is confounded with the experimental groups.

This means the check depends on the structure that is actually encoded in the dataset. If batch, plate, lane, day, operator, instrument, or run-order information is missing from the uploaded table, Licklider cannot infer that hidden structure automatically from group values alone. In that situation, a real confound can remain undetected and the group difference may still reflect processing structure rather than biology.

Batch and plate columns Licklider examines how batch or plate membership overlaps with group membership. A complete confound occurs when every sample in a group belongs to the same batch and every sample in another group belongs to a different batch. A partial confound occurs when the overlap is substantial but not total.

Run order columns Licklider checks whether the median run order differs systematically across groups. A suspicious run order pattern is flagged as a potential source of systematic bias but does not on its own indicate confounding at the same severity as batch or plate overlap.

Run order is treated more cautiously than batch or plate overlap because order trends can suggest drift without proving that the group effect is fully inseparable from processing order.


Severity levels

The confounding check assigns a severity level based on how complete the overlap is:

  • Complete confound — all samples in each group belong to a single distinct batch or plate. The batch and group effects are fully inseparable.
  • Partial confound — there is substantial overlap between batch and group, but some cross-batch representation exists.
  • Suspicious run order — run order differs across groups in a way that warrants attention, but does not constitute a categorical confound.

What you are asked to confirm

When confounding is detected, Licklider presents a confirmation step with the following options:

Disclose the batch or plate structure and continue Acknowledge that the batch structure exists and include a disclosure in the figure output. This is the appropriate path when the confounding is unavoidable but the result is still informative, and when the limitation will be reported.

Keep batch structure visible in the figure Encode the batch or plate as a visual element — for example, as a shape, color, or facet — so that the structure is transparent to the reader. This approach makes the confound explicit in the visualization rather than only in the disclosure text.

Show a descriptive figure only The confounding is unresolved and the figure will be treated as descriptive only, not eligible for inferential claims.


What Licklider does not do

Licklider detects and discloses batch confounding but does not automatically apply a statistical correction for batch effects. Approaches such as batch correction before analysis, including batch as a covariate in the model, or using a mixed model that accounts for the batch structure are not applied automatically.

That boundary is deliberate. Automatic correction can hide an important design problem, make the confound look more resolved than it really is, and obscure which assumptions were introduced to separate biology from processing structure. Licklider therefore surfaces the problem first and asks you to choose an explicit disclosure or modeling path instead of silently fixing it in the background.

If your data has a substantial batch confound and you want to correct for it, address the structure in the Prep panel or specify a model in the Chat that includes batch as a covariate.

Licklider also does not guarantee that every harmful processing structure is visible to this check. If the true source of bias is stored only in lab notes, instrument logs, sample handling history, or another metadata source outside the uploaded table, the current check cannot detect it automatically.


Effect on export

When batch confounding is detected and unresolved, the Inspector will indicate that confirmation is required. Claim-bearing export is blocked until one of the three options above is selected.


What this page does not cover


Design Rationale & References

Licklider's design choices

Licklider treats batch and plate confounding as a design and interpretation problem before it treats it as a modeling problem. The goal of the check is to stop researchers from mistaking a processing artifact for a biological effect when the two are too entangled to be separated cleanly from the observed data alone. That is why the product emphasizes detection, disclosure, visual transparency, and claim-bearing export control instead of silent automatic correction.

The confirmation step exists because not every confound has the same consequence. Some datasets remain useful as descriptive evidence if the batch structure is made visible. Others may still support a model-based analysis if batch is handled explicitly upstream. The design therefore asks the reader to acknowledge the limitation rather than letting the software imply that confounding has been solved automatically.

Methodological foundations

  1. Leek, J. T., Scharpf, R. B., Corrada Bravo, H., Simcha, D., Langmead, B., Johnson, W. E., Geman, D., Baggerly, K., & Irizarry, R. A. (2010). Tackling the widespread and critical impact of batch effects in high-throughput data. Nature Reviews Genetics, 11(10), 733-739.

    → A standard reference for why batch effects can dominate apparent biological signal and why they must be identified and handled explicitly.

Implementation boundaries

  • Licklider can only evaluate batch, plate, and run-order structure that is present in the uploaded dataset or mapped metadata.
  • If the processing structure is missing, mislabeled, or stored outside the table being analyzed, Licklider cannot automatically detect that hidden confound.
  • The current check detects and discloses confounding, but it does not automatically perform batch correction or choose a corrected inferential model for you.
  • A flagged run-order pattern is a warning about possible systematic bias, not proof that the group effect is fully explained by order alone.