Superheat: Supervised heatmaps for visualizing complex data
Technological advancements of the modern era have enabled the collection of huge amounts of data in science and beyond. Accordingly, computationally intensive statistical and machine learning algorithms are being used to seek answers to increasingly complex questions. Although visualization has the potential to be a powerful aid to the modern information extraction process, visualizing high-dimensional data is an ongoing challenge. In this paper we introduce the supervised heatmap, called superheat, which is a new graph that builds upon existing clustered heatmaps that are widely used in fields such as bioinformatics. Supervised heatmaps have two primary aims: to provide a means of visual extraction of the information contained within high-dimensional datasets, and to provide a visual assessment of the performance of model fits to these datasets. We will use two case studies to demonstrate the practicality and usefulness of supervised heatmaps in achieving these goals. The first will examine crime in US communities for which we will use the supervised heatmaps to gain an in-depth understanding of the information contained within the data, the clarity of which is unparalleled by existing visualization methods. The second case study will explore neural activity in the visual cortex where we will use supervised heatmaps to guide an exploration of the suitability of a Lasso-based linear model in predicting brain activity. Supervised heatmaps are implemented via the superheat package written in the R programming software.
Embarrassingly Parallel Sequential Markov-chain Monte Carlo for Large Sets of Time Series
Bayesian computation crucially relies on Markov chain Monte Carlo (MCMC) algorithms. In the case of massive data sets, running the Metropolis-Hastings sampler to draw from the posterior distribution becomes prohibitive due to the large number of likelihood terms that need to be calculated at each iteration. In order to perform Bayesian inference for a large set of time series, we consider an algorithm that combines ‘divide and conquer’ ideas previously used to design MCMC algorithms for big data with a sequential MCMC strategy. The performance of the method is illustrated using a large set of financial data.
Topic segmentation via community detection in complex networks
Many real systems have been modelled in terms of network concepts, and written texts are a particular example of information networks. In recent years, the use of network methods to analyze language has allowed the discovery of several interesting findings, including the proposition of novel models to explain the emergence of fundamental universal patterns. While syntactical networks, one of the most prevalent networked models of written texts, display both scale-free and small-world properties, such representation fails in capturing other textual features, such as the organization in topics or subjects. In this context, we propose a novel network representation whose main purpose is to capture the semantical relationships of words in a simple way. To do so, we link all words co-occurring in the same semantic context, which is defined in a threefold way. We show that the proposed representations favours the emergence of communities of semantically related words, and this feature may be used to identify relevant topics. The proposed methodology to detect topics was applied to segment selected Wikipedia articles. We have found that, in general, our methods outperform traditional bag-of-words representations, which suggests that a high-level textual representation may be useful to study semantical features of texts.
Averaged extreme regression quantile
Various events in the nature, economics and in other areas force us to combine the study of extremes with regression and other methods. A useful tool for reducing the role of nuisance regression, while we are interested in the shape or tails of the basic distribution, is provided by the averaged regression quantile and namely by the average extreme regression quantile. Both are weighted means of regression quantile components, with weights depending on the regressors. Our primary interest is the averaged extreme regression quantile (AERQ), its structure, qualities and its applications, e.g. in investigation of a conditional loss given a value exogenous economic and market variables. AERQ has several interesting equivalent forms: While it is originally defined as an optimal solution of a specific linear programming problem, hence is a weighted mean of responses corresponding to the optimal base of the pertaining linear program, we give another equivalent form as a maximum residual of responses from a specific R-estimator of the slope components of regression parameter. The latter form shows that while AERQ equals to the maximum of some residuals of the responses, it has minimal possible perturbation by the regressors. Notice that these finite-sample results are true even for non-identically distributed model errors, e.g. under heteroscedasticity. Moreover, the representations are formally true even when the errors are dependent – this all provokes a question of the right interpretation and of other possible applications.
Proposition of a Theoretical Model for Missing Data Imputation using Deep Learning and Evolutionary Algorithms
In the last couple of decades, there has been major advancements in the domain of missing data imputation. The techniques in the domain include amongst others: Expectation Maximization, Neural Networks with Evolutionary Algorithms or optimization techniques and K-Nearest Neighbor approaches to solve the problem. The presence of missing data entries in databases render the tasks of decision-making and data analysis nontrivial. As a result this area has attracted a lot of research interest with the aim being to yield accurate and time efficient and sensitive missing data imputation techniques especially when time sensitive applications are concerned like power plants and winding processes. In this article, considering arbitrary and monotone missing data patterns, we hypothesize that the use of deep neural networks built using autoencoders and denoising autoencoders in conjunction with genetic algorithms, swarm intelligence and maximum likelihood estimator methods as novel data imputation techniques will lead to better imputed values than existing techniques. Also considered are the missing at random, missing completely at random and missing not at random missing data mechanisms. We also intend to use fuzzy logic in tandem with deep neural networks to perform the missing data imputation tasks, as well as different building blocks for the deep neural networks like Stacked Restricted Boltzmann Machines and Deep Belief Networks to test our hypothesis. The motivation behind this article is the need for missing data imputation techniques that lead to better imputed values than existing methods with higher accuracies and lower errors.
• Reuse of Neural Modules for General Video Game Playing
• Emergence and coherence of oscillations in star networks of stochastic excitable elements
• Learning the Semantics of Manipulation Action
• Spatial Scaling of Land Cover Networks
• Matroid invariants and counting graph homomorphisms
• On the Min-cost Traveling Salesman Problem with Drone
• Concentration of information content for convex measures
• Sweeping up Zeta
• Lace expansion for dummies
• Robust estimators of accelerated failure time regression with generalized log-gamma errors
• Random Tensor models: Combinatorics, Geometry, Quantum Gravity and Integrability
• Controlling Statistical Moments of Stochastic Dynamical Networks
• Game Distinguishing Numbers of Cartesian Products of Graphs
• On the lattice of subracks of the rack of a finite group
• Central limit theorems for the real eigenvalues of large Gaussian random matrices
• Maximum Rank and Asymptotic Rank of Finite Dynamical Systems
• Stanley’s nonunimodal Gorenstein h-vector is optimal
• What Makes it Difficult to Understand a Scientific Literature?
• Neuron’s Eye View: Inferring Features of Complex Stimuli from Neural Responses
• Max-Pooling Dropout for Regularization of Convolutional Neural Networks
• On connected simple graphs and their degree sequences
• Locally Adaptive Translation for Knowledge Graph Embedding
• MCMC convergence diagnosis using geometry of Bayesian LASSO
• On the possible values of the entropy of undirected graphs
• Neural Generative Question Answering
• On the Rectilinear Crossing Number of Complete Uniform Hypergraphs
• Laplacian Coefficient, Matching Polynomial and Incidence Energy of of Trees with Described Maximum Degree
• Q-Networks for Binary Vector Actions
• Toward a Taxonomy and Computational Models of Abnormalities in Images
• An algorithm for finding Hamiltonian Cycles in Cubic Planar Graphs
• Fixed Point Performance Analysis of Recurrent Neural Networks
• Isotropy of Frequencies and Weak Chimeras With Broken Symmetry
• An Online Unsupervised Structural Plasticity Algorithm for Spiking Neural Networks
• A Central Limit Theorem for Non-Stationary Strongly Mixing Random Fields
• Efficient simulation of Schrödinger equation with piecewise constant positive potential
• Cell-probe Lower Bounds for Dynamic Problems via a New Communication Model
• Predicting psychological attributions from face photographs with a deep neural network
• Adjusting for Chance Clustering Comparison Measures
• Predicting the top and bottom ranks of billboard songs using Machine Learning
• An Investigation into Graph Curvature’s Ability to Measure Congestion in Network Flow
• Spreading, Nonergodicity, and Selftrapping: a puzzle of interacting disordered lattice waves
• On Polynomial Bounds of Convergence for the Availability Factor
• MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems
• CrossCat: A Fully Bayesian Nonparametric Method for Analyzing Heterogeneous, High Dimensional Data
• Optimal design, financial and risk modelling with stochastic processes having semicontinuous covariances
• Reconstruction of Real depth-3 Circuits with top fan-in 2
• MERLiN: Mixture Effect Recovery in Linear Networks
• Estimating sparse precision matrices
Like this:
Like Loading...