SUCCESSIVE NORMALIZATION OF RECTANGULAR ARRAYS.

Ann Stat

Depts. of Health Research and Policy, Electrical Engineering, and Statistics, Stanford, CA 94305-5405, U.S.A.

Published: June 2010

Standard statistical techniques often require transforming data to have mean 0 and standard deviation 1. Typically, this process of "standardization" or "normalization" is applied across subjects when each subject produces a single number. High throughput genomic and financial data often come as rectangular arrays, where each coordinate in one direction concerns subjects, who might have different status (case or control, say); and each coordinate in the other designates "outcome" for a specific feature, for example "gene," "polymorphic site," or some aspect of financial profile. It may happen when analyzing data that arrive as a rectangular array that one requires BOTH the subjects and features to be "on the same footing." Thus, there may be a need to standardize across rows and columns of the rectangular matrix. There arises the question as to how to achieve this double normalization. We propose and investigate the convergence of what seems to us a natural approach to successive normalization, which we learned from colleague Bradley Efron. We also study the implementation of the method on simulated data and also on data that arose from scientific experimentation.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2868388	PMC
http://dx.doi.org/10.1214/09-AOS743	DOI Listing

Publication Analysis

Top Keywords

successive normalization

rectangular arrays

data

rectangular

normalization rectangular

arrays standard

standard statistical

statistical techniques

techniques require

require transforming

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!