Big data is becoming more prevalent in psychology and the behavioral sciences, and so are the methodological and statistical issues that arise from its use. Psychologists need to be equipped to deal with these. Big data can be generated in experimental studies where, for example, participants' physiological and psychological responses are tracked over time or where human brain imaging is employed. Observational data from websites such as Facebook, Twitter, and Google is also of increasing interest to psychologists. These sometimes huge data sets, which are often too large for standard computers and can also contain multiple types of data, bring with them challenging questions about data quality and the generalizability of the results as well as which statistical tools are suitable for analyzing them.
The contributions in this volume explore these challenges, looking at the potential of applying machine learning techniques to big data in psychology as well as the split/analyze/meta-analyze (SAM) approach, which allows big data to be split up into smaller datasets so they can be analyzed with conventional multivariate techniques on standard computers. The issues of replicability, prediction accuracy, and combining types of data are also investigated.