Balancing Type I Error and Power in Linear Mixed Models

Linear mixed-effects models have increasingly replaced mixed-model analyses of variance for statistical inference in factorial psycholinguistic experiments. The advantages of LMMs over ANOVAs, however, come at a cost: Setting up an LMM is not as straightforward as running an ANOVA. One simple option, when numerically possible, is to fit the full variance-covariance structure of random effects (the maximal model; Barr et al., 2013), presumably to keep Type I error down to the nominal \alpha in the presence of random effects. Although it is true that fitting a model with only random intercepts may lead to higher Type I error, fitting a maximal model also has a cost: it can lead to a significant loss of power. We demonstrate this with simulations and suggest that for typical psychological and psycholinguistic data, models with a random effect structure that is supported by the data have optimal Type I error and power properties.


Survival and lifetime data analysis with a flexible class of distributions

We introduce a general class of continuous univariate distributions with positive support obtained by transforming the class of two-piece distributions. We show that this class of distributions is very flexible, easy to implement, and contains members that can capture different tail behaviours and shapes, producing also a variety of hazard functions. The proposed distributions represent a flexible alternative to the classical choices such as the log-normal, Gamma, and Weibull distributions. We investigate empirically the inferential properties of the proposed models through an extensive simulation study. We present some applications using real data in the contexts of time-to-event and accelerated failure time models. In the second kind of applications, we explore the use of these models in the estimation of the distribution of the individual remaining life.


A note on the evaluation of generative models

Probabilistic generative models can be used for compression, denoising, inpainting, texture synthesis, semi-supervised learning, unsupervised feature learning, and other tasks. Given this wide range of applications, it is not surprising that a lot of heterogeneity exists in the way image models are formulated, trained, and evaluated. As a consequence, direct comparison between image models is often difficult. This article reviews mostly known but often underappreciated properties relating to the evaluation and interpretation of generative models. In particular, we show that three of the currently most commonly used criteria—average log-likelihood, Parzen window estimates, and visual fidelity of samples—are largely independent of each other when images are high-dimensional. Good performance with respect to one criterion therefore need not imply good performance with respect to the other criteria. Our results show that extrapolation from one criterion to another is not warranted and generative models need to be evaluated directly with respect to the application(s) they were intended for. In addition, we provide examples demonstrating that Parzen window estimates should generally be avoided.


Model Based Clustering for Mixed Data: clustMD

A model based clustering procedure for data of mixed type, clustMD, is developed using a latent variable model. It is proposed that a latent variable, following a mixture of Gaussian distributions, generates the observed data of mixed type. The observed data may be any combination of continuous, binary, ordinal or nominal variables. clustMD employs a parsimonious covariance structure for the latent variables, leading to a suite of six clustering models that vary in complexity and provide an elegant and unified approach to clustering mixed data. An expectation maximisation (EM) algorithm is used to estimate clustMD; in the presence of nominal data a Monte Carlo EM algorithm is required. The clustMD model is illustrated by clustering simulated mixed type data and prostate cancer patients, on whom mixed data have been recorded.


Likelihood Component Analysis

Independent component analysis (ICA) is popular in many applications, including cognitive neuroscience and signal processing. Due to computational constraints, principal component analysis is used for dimension reduction prior to ICA (PCA+ICA), which could remove important information. The problem is that interesting independent components (ICs) could be mixed in several principal components that are discarded and then these ICs cannot be recovered. To address this issue, we propose likelihood component analysis (LCA), a novel methodology in which dimension reduction and latent variable estimation is achieved simultaneously by maximizing a likelihood with Gaussian and non-Gaussian components. We present a parametric LCA model using the logistic density and a semi-parametric LCA model using tilted Gaussians with cubic B-splines. We implement an algorithm scalable to datasets common in applications (e.g., hundreds of thousands of observations across hundreds of variables with dozens of latent components). In simulations, our methods recover latent components that are discarded by PCA+ICA methods. We apply our method to dependent multivariate data and demonstrate that LCA is a useful data visualization and dimension reduction tool that reveals features not apparent from PCA or PCA+ICA. We also apply our method to an experiment from the Human Connectome Project with state-of-the-art temporal and spatial resolution and identify an artifact using LCA that was missed by PCA+ICA. We present theoretical results on identifiability of the LCA model and consistency of our estimator.


A new approach to optimal designs for correlated observations

Speed of Vertex reinforced jump process on Galton-Watson trees

Theory of the strongly disordered Weyl semimetal

Thoughts on Massively Scalable Gaussian Processes

Convolutional Neural Network for Stereotypical Motor Movement Detection in Autism

Can parametric statistical methods be trusted for fMRI based group studies?

Modeling trend progression through an extension of the Polya Urn Process

Autoregressive Model for Individual Consumption Data – LASSO Selection and Significance Test

Sparse approximation by greedy algorithms

Fault-Tolerant Distributed Optimization (Part IV): Constrained Optimization with Arbitrary Directed Networks

An Abstract Model for Branching and its Application to Mixed Integer Programming

Percolation and Random Walks

Needles and straw in a haystack: empirical Bayes confidence for possibly sparse sequences

Explosive Percolation: Novel critical and supercritical phenomena

Optimality gaps in asymptotic dimensioning of many-server systems

Queueing Analysis of Unicast IPTV With User Mobility and Adaptive Modulation and Coding in Wireless Cellular Networks

Maximizing Kirchhoff index of unicyclic graphs with fixed maximum degree

On the Cost of Concurrency in Transactional Memory

Computational Intractability of Dictionary Learning for Sparse Representation

The distance-dependent two-point function of triangulations: a new derivation from old results

Pattern matching in $(213,231)$-avoiding permutations

Discrete Rényi Classifiers

On real growth and run-off companies in insurance ruin theory

Markovian and product quantization of an R^d -valued Euler scheme of a diffusion process with applications to finance

‘Pale as death’ or ‘pâle comme la mort’: Frozen similes used as literary clichés

Symmetry-invariant optimization in deep networks

Exponential inequalities for unbounded functions of geometrically ergodic Markov chains. Applications to quantitative error bounds for regenerative Metropolis algorithms

Invariance principle for local time by quasi-compactness

The monoid algebra of all relations on a finite set

Critical multi-type galton-watson trees conditioned to be large

Computable bounds of ${\ell}^2$-spectral gap for discrete Markov chains with band transition matrices

Adaptive information-theoretic bounded rational decision-making with parametric priors

Getting started with particle Metropolis-Hastings for inference in nonlinear dynamical models

On Low Rank Approximation of Binary Matrices

Exploratory Analysis of Multivariate Longitudinal Child Education Data

The Assessment of Performance of Correlation Estimates in Discrete Bivariate Distributions Using Bootstrap Methodology

Heat kernels in the context of Kato potentials on arbitrary manifolds

Variations on a theme – the evolution of hydrocarbon solids: I. Compositional and spectral modelling – the eRCN and DG models

Comparing Writing Styles using Word Embedding and Dynamic Time Warping

An Empirical Study on Sentiment Classification of Chinese Review using Word Embedding

Stochastic Proximal Gradient Descent for Nuclear Norm Regularization

The characterisation of irregularly-shaped particles: a re-consideration of finite-sized, porous and fractal grains

Note on Perfect Forests in Digraphs

A Bayesian approach to the evaluation of risk-based microbiological criteria for \uppercaseCampylobacter in broiler meat

Total dominator chromatic number of specific graphs

Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model

Quantile regression for mixed models with an application to examine blood pressure trends in China

Computing sets of graded attribute implications with witnessed non-redundancy

A Bayesian spatiotemporal model for reconstructing climate from multiple pollen records

An Active-Sensing Approach to Channel Vector Subspace Estimation in mm-Wave Massive MIMO Systems

On martingale tail sums in affine two-color urn models with multiple drawings

A central limit theorem for stochastic heat equations in random environment

Tests for High-Dimensional Covariance Matrices Using Random Matrix Projection

A Simple Approach to Optimal CUR Decomposition

Robust data assimilation using $L_1$ and Huber norms

Multinomial Loss on Held-out Data for the Sparse Non-negative Matrix Language Model

Strong Scaling for Numerical Weather Prediction at Petascale with the Atmospheric Model NUMA

Color Aesthetics and Social Networks in Complete Tang Poems: Explorations and Discoveries

Horton Law in Self-Similar Trees

Mining Local Gazetteers of Literary Chinese with CRF and Pattern based Methods for Biographical Information in Chinese History

Regularization and Bayesian Learning in Dynamical Systems: Past, Present and Future

Sparse movement data can reveal social influences on individual travel decisions

Mean-field inference of Hawkes point processes