Proc ACM SIGACT SIGMOD SIGART Symp Princ Database Syst
June 2021
Given an × dimensional dataset , a projection query specifies a subset ⊆ [] of columns which yields a new × || array. We study the space complexity of computing data analysis functions over such subspaces, including heavy hitters and norms, when the subspaces are revealed only after observing the data. We show that this important class of problems is typically hard: for many problems, we show 2 lower bounds.
View Article and Find Full Text PDF