Superheat: Supervised heatmaps for visualizing complex data

Technological advancements of the modern era have enabled the collection of huge amounts of data in science and beyond. Accordingly, computationally intensive statistical and machine learning algorithms are being used to seek answers to increasingly complex questions. Although visualization has the potential to be a powerful aid to the modern information extraction process, visualizing high-dimensional data is an ongoing challenge. In this paper we introduce the supervised heatmap, called superheat, which is a new graph that builds upon existing clustered heatmaps that are widely used in fields such as bioinformatics. Supervised heatmaps have two primary aims: to provide a means of visual extraction of the information contained within high-dimensional datasets, and to provide a visual assessment of the performance of model fits to these datasets. We will use two case studies to demonstrate the practicality and usefulness of supervised heatmaps in achieving these goals. The first will examine crime in US communities for which we will use the supervised heatmaps to gain an in-depth understanding of the information contained within the data, the clarity of which is unparalleled by existing visualization methods. The second case study will explore neural activity in the visual cortex where we will use supervised heatmaps to guide an exploration of the suitability of a Lasso-based linear model in predicting brain activity. Supervised heatmaps are implemented via the superheat package written in the R programming software.

Embarrassingly Parallel Sequential Markov-chain Monte Carlo for Large Sets of Time Series

Bayesian computation crucially relies on Markov chain Monte Carlo (MCMC) algorithms. In the case of massive data sets, running the Metropolis-Hastings sampler to draw from the posterior distribution becomes prohibitive due to the large number of likelihood terms that need to be calculated at each iteration. In order to perform Bayesian inference for a large set of time series, we consider an algorithm that combines ‘divide and conquer’ ideas previously used to design MCMC algorithms for big data with a sequential MCMC strategy. The performance of the method is illustrated using a large set of financial data.

Topic segmentation via community detection in complex networks

Many real systems have been modelled in terms of network concepts, and written texts are a particular example of information networks. In recent years, the use of network methods to analyze language has allowed the discovery of several interesting findings, including the proposition of novel models to explain the emergence of fundamental universal patterns. While syntactical networks, one of the most prevalent networked models of written texts, display both scale-free and small-world properties, such representation fails in capturing other textual features, such as the organization in topics or subjects. In this context, we propose a novel network representation whose main purpose is to capture the semantical relationships of words in a simple way. To do so, we link all words co-occurring in the same semantic context, which is defined in a threefold way. We show that the proposed representations favours the emergence of communities of semantically related words, and this feature may be used to identify relevant topics. The proposed methodology to detect topics was applied to segment selected Wikipedia articles. We have found that, in general, our methods outperform traditional bag-of-words representations, which suggests that a high-level textual representation may be useful to study semantical features of texts.

Averaged extreme regression quantile

Various events in the nature, economics and in other areas force us to combine the study of extremes with regression and other methods. A useful tool for reducing the role of nuisance regression, while we are interested in the shape or tails of the basic distribution, is provided by the averaged regression quantile and namely by the average extreme regression quantile. Both are weighted means of regression quantile components, with weights depending on the regressors. Our primary interest is the averaged extreme regression quantile (AERQ), its structure, qualities and its applications, e.g. in investigation of a conditional loss given a value exogenous economic and market variables. AERQ has several interesting equivalent forms: While it is originally defined as an optimal solution of a specific linear programming problem, hence is a weighted mean of responses corresponding to the optimal base of the pertaining linear program, we give another equivalent form as a maximum residual of responses from a specific R-estimator of the slope components of regression parameter. The latter form shows that while AERQ equals to the maximum of some residuals of the responses, it has minimal possible perturbation by the regressors. Notice that these finite-sample results are true even for non-identically distributed model errors, e.g. under heteroscedasticity. Moreover, the representations are formally true even when the errors are dependent – this all provokes a question of the right interpretation and of other possible applications.

Proposition of a Theoretical Model for Missing Data Imputation using Deep Learning and Evolutionary Algorithms

In the last couple of decades, there has been major advancements in the domain of missing data imputation. The techniques in the domain include amongst others: Expectation Maximization, Neural Networks with Evolutionary Algorithms or optimization techniques and K-Nearest Neighbor approaches to solve the problem. The presence of missing data entries in databases render the tasks of decision-making and data analysis nontrivial. As a result this area has attracted a lot of research interest with the aim being to yield accurate and time efficient and sensitive missing data imputation techniques especially when time sensitive applications are concerned like power plants and winding processes. In this article, considering arbitrary and monotone missing data patterns, we hypothesize that the use of deep neural networks built using autoencoders and denoising autoencoders in conjunction with genetic algorithms, swarm intelligence and maximum likelihood estimator methods as novel data imputation techniques will lead to better imputed values than existing techniques. Also considered are the missing at random, missing completely at random and missing not at random missing data mechanisms. We also intend to use fuzzy logic in tandem with deep neural networks to perform the missing data imputation tasks, as well as different building blocks for the deep neural networks like Stacked Restricted Boltzmann Machines and Deep Belief Networks to test our hypothesis. The motivation behind this article is the need for missing data imputation techniques that lead to better imputed values than existing methods with higher accuracies and lower errors.

Reuse of Neural Modules for General Video Game Playing

Emergence and coherence of oscillations in star networks of stochastic excitable elements

Learning the Semantics of Manipulation Action

Spatial Scaling of Land Cover Networks

Matroid invariants and counting graph homomorphisms

On the Min-cost Traveling Salesman Problem with Drone

Concentration of information content for convex measures

Sweeping up Zeta

Lace expansion for dummies

Robust estimators of accelerated failure time regression with generalized log-gamma errors

Random Tensor models: Combinatorics, Geometry, Quantum Gravity and Integrability

Controlling Statistical Moments of Stochastic Dynamical Networks

Game Distinguishing Numbers of Cartesian Products of Graphs

On the lattice of subracks of the rack of a finite group

Central limit theorems for the real eigenvalues of large Gaussian random matrices

Maximum Rank and Asymptotic Rank of Finite Dynamical Systems

Stanley’s nonunimodal Gorenstein h-vector is optimal

What Makes it Difficult to Understand a Scientific Literature?

Neuron’s Eye View: Inferring Features of Complex Stimuli from Neural Responses

Max-Pooling Dropout for Regularization of Convolutional Neural Networks

On connected simple graphs and their degree sequences

Locally Adaptive Translation for Knowledge Graph Embedding

MCMC convergence diagnosis using geometry of Bayesian LASSO

On the possible values of the entropy of undirected graphs

Neural Generative Question Answering

On the Rectilinear Crossing Number of Complete Uniform Hypergraphs

Laplacian Coefficient, Matching Polynomial and Incidence Energy of of Trees with Described Maximum Degree

Q-Networks for Binary Vector Actions

Toward a Taxonomy and Computational Models of Abnormalities in Images

An algorithm for finding Hamiltonian Cycles in Cubic Planar Graphs

Fixed Point Performance Analysis of Recurrent Neural Networks

Isotropy of Frequencies and Weak Chimeras With Broken Symmetry

An Online Unsupervised Structural Plasticity Algorithm for Spiking Neural Networks

A Central Limit Theorem for Non-Stationary Strongly Mixing Random Fields

Efficient simulation of Schrödinger equation with piecewise constant positive potential

Cell-probe Lower Bounds for Dynamic Problems via a New Communication Model

Predicting psychological attributions from face photographs with a deep neural network

Adjusting for Chance Clustering Comparison Measures

Predicting the top and bottom ranks of billboard songs using Machine Learning

An Investigation into Graph Curvature’s Ability to Measure Congestion in Network Flow

Spreading, Nonergodicity, and Selftrapping: a puzzle of interacting disordered lattice waves

On Polynomial Bounds of Convergence for the Availability Factor

MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems

CrossCat: A Fully Bayesian Nonparametric Method for Analyzing Heterogeneous, High Dimensional Data

Optimal design, financial and risk modelling with stochastic processes having semicontinuous covariances

Reconstruction of Real depth-3 Circuits with top fan-in 2

MERLiN: Mixture Effect Recovery in Linear Networks

Estimating sparse precision matrices