What is Information?

Information is a precise concept that can be defined mathematically, but its relationship to what we call ‘knowledge’ is not always made clear. Furthermore, the concepts ‘entropy’ and ‘information’, while deeply related, are distinct and must be used with care, something that is not always achieved in the literature. In this elementary introduction, the concepts of entropy and information are laid out one by one, explained intuitively, but defined rigorously. I argue that a proper understanding of information in terms of prediction is key to a number of disciplines beyond engineering, such as physics and biology.


Nested Orthogonal Arrays

Orthogonal Arrays allow us to test various levels of each factor and balance the different factors so that we can estimate interactions as well as first order effects. There is a trade-off between how well we can sample different levels of each factor and how many interactions we are able to estimate. This paper describes one method to mitigate this trade-off. This method will allow us, with n observations, to sample n levels of each factor and minimize the correlation between the estimates of first order terms and their interactions.


Quegel: A General-Purpose Query-Centric Framework for Querying Big Graphs

Pioneered by Google’s Pregel, many distributed systems have been developed for large-scale graph analytics. These systems expose the user-friendly ‘think like a vertex’ programming interface to users, and exhibit good horizontal scalability. However, these systems are designed for tasks where the majority of graph vertices participate in computation, but are not suitable for processing light-workload graph queries where only a small fraction of vertices need to be accessed. The programming paradigm adopted by these systems can seriously under-utilize the resources in a cluster for graph query processing. In this work, we develop a new open-source system, called Quegel, for querying big graphs, which treats queries as first-class citizens in the design of its computing model. Users only need to specify the Pregel-like algorithm for a generic query, and Quegel processes light-workload graph queries on demand using a novel superstep-sharing execution model to effectively utilize the cluster resources. Quegel further provides a convenient interface for constructing graph indexes, which significantly improve query performance but are not supported by existing graph-parallel systems. Our experiments verified that Quegel is highly efficient in answering various types of graph queries and is up to orders of magnitude faster than existing systems.


Is swarm intelligence able to create mazes?

In this paper, the idea of applying Computational Intelligence in the process of creation board games, in particular mazes, is presented. For two different algorithms the proposed idea has been examined. The results of the experiments are shown and discussed to present advantages and disadvantages.


Expected Similarity Estimation for Large-Scale Batch and Streaming Anomaly Detection

We present a novel algorithm for anomaly detection on very large datasets and data streams. The method, named EXPected Similarity Estimation (EXPoSE), is kernel-based and able to efficiently compute the similarity between new data points and the distribution of regular data. The estimator is formulated as an inner product with a reproducing kernel Hilbert space embedding and makes no assumption about the type or shape of the underlying data distribution. We show that offline (batch) learning with EXPoSE can be done in linear time and online (incremental) learning takes constant time per instance and model update. Furthermore, EXPoSE can make predictions in constant time, while it requires only constant memory. In addition we propose different methodologies for concept drift adaptation on evolving data streams. On several real datasets we demonstrate that our approach can compete with state of the art algorithms for anomaly detection while being significant faster than techniques with the same discriminant power.


A Taxonomy of Deep Convolutional Neural Nets for Computer Vision

Traditional architectures for solving computer vision problems and the degree of success they enjoyed have been heavily reliant on hand-crafted features. However, of late, deep learning techniques have offered a compelling alternative — that of automatically learning problem-specific features. With this new paradigm, every problem in computer vision is now being re-examined from a deep learning perspective. Therefore, it has become important to understand what kind of deep networks are suitable for a given problem. Although general surveys of this fast-moving paradigm (i.e. deep-networks) exist, a survey specific to computer vision is missing. We specifically consider one form of deep networks widely used in computer vision – convolutional neural networks (CNNs). We start with ‘AlexNet’ as our base CNN and then examine the broad variations proposed over time to suit different applications. We hope that our recipe-style survey will serve as a guide, particularly for novice practitioners intending to use deep-learning techniques for computer vision.


Pixel Recurrent Neural Networks

Modeling the distribution of natural images is a landmark problem in unsupervised learning. This task requires an image model that is at once expressive, tractable and scalable. We present a deep neural network that sequentially predicts the pixels in an image along the two spatial dimensions. Our method models the discrete probability of the raw pixel values and encodes the complete set of dependencies in the image. Architectural novelties include fast two-dimensional recurrent layers and an effective use of residual connections in deep recurrent networks. We achieve log-likelihood scores on natural images that are considerably better than the previous state of the art. Our main results also provide benchmarks on the diverse ImageNet dataset. Samples generated from the model appear crisp, varied and globally coherent.


On the Latent Variable Interpretation in Sum-Product Networks

Superfluid–Insulator Transition in Strongly Disordered One-dimensional Systems

Branching Rules for Symmetric Hypergeometric Polynomials

Universal Collaboration Strategies for Signal Detection: A Sparse Learning Approach

Rectified Gaussian Scale Mixtures and the Sparse Non-Negative Least Squares Problem

Nonparametric Heterogeneity Testing For Massive Data

Large Deviation Principle For Finite-State Mean Field Interacting Particle Systems

Precise Error Analysis of Regularized M-estimators in High-dimensions

A Mixed-effects Model for Incomplete Data With Batch-Level Abundance-Dependent Missing-Data Mechanism

Divide and Conquer Local Average Regression

Artificial Persuasion in Pedagogical Games

Automatic recognition of element classes and boundaries in the birdsong with variable sequences

A proof of the Square Paths Conjecture

Minimax Lower Bounds for Linear Independence Testing

Ergodicity of inhomogeneous Markov chains through asymptotic pseudotrajectories

Large deviation principle of occupation measures for Non-linear monotone SPDEs

Smooth densities of the laws of perturbed diffusion processes

Fortuitous sequences of flips of the top of a stack of n burnt pancakes for all n>24

Using an Instrumental Variable to Test for Unmeasured Confounding

A Characterization for the Existence of Connected $f$-Factors of $\textit{ Large}$ Minimum Degree

Mediation Analysis for Count and Zero-Inflated Count Data without Sequential Ignorability

$LS$-Category of Moment-Angle Manifolds and a Generalization of the Golod Property

Undecidability of the Lambek calculus with a relevant modality

Change-Sensitive Algorithms for Maintaining Maximal Cliques in a Dynamic Graph

Exit Laws of Isotropic Diffusions in Random Environment from Large Domains

An upper bound on the size of diamond-free families of sets

Coexistence and Exclusion of Stochastic Competitive Lotka-Volterra Models

Fast Binary Embedding via Circulant Downsampled Matrix — A Data-Independent Approach

Inference of Latent Network Features via Co-Intersection Representations of Graphs

Interval Estimation for Conditional Failure Rates of Transmission Lines with Limited Samples

Product of Non-Hermitian Random Matrices

Domino Tilings of the Torus

On the conjecture by Demyanov-Ryabova in converting finite exhausters

Crystallizing the hypoplactic monoid: from quasi-Kashiwara operators to the Robinson–Schensted-type correspondence for quasi-ribbon tableaux

Randomly juggling backwards

On detecting and quantification of randomness for one-sided sequences

Discrete quantitative nodal theorem

Synthesis of Gaussian Trees with Correlation Sign Ambiguity: An Information Theoretic Approach

A survey of Elekes-Rónyai-type problems

Stochastic Quantization for the fractional Edwards Measure

On generalized Gaussian free fields and stochastic homogenization

A New Information Theoretical Concept: Information-Weighted Heavy-tailed Distributions

Pairing of Zeros and Critical Points for Random Polynomials

Explicit moments of decision times for single- and double-threshold drift-diffusion processes

Accelerated Nonparametric Maximum Likelihood Density Deconvolution Using Bernstein Polynomial

Sequential Hypothesis Test with Online Usage-Constrained Sensor Selection

Analysis of centrality in sublinear preferential attachment trees via the CMJ branching process

On universal partial words over binary alphabets

Strata of discriminantal arrangements

A new correlation clustering method for cancer mutation analysis

Congruences and recursions for the cubic partitions

Lightweight Fault Tolerance in Large-Scale Distributed Graph Processing

The peak statistics on simsun permutations

On multiplier processes under weak moment assumptions

Meta-analysis of few small studies in orphan diseases

Discrete analogues of Macdonald-Mehta integrals

Finite sample properties of the mean occupancy counts and probabilities

Robust Influence Maximization

On Rényi Entropy Power Inequalities

From survival to extinction of the contact process by the removal of a single edge

Towards Resolving Unidentifiability in Inverse Reinforcement Learning

A Kernel Independence Test for Geographical Language Variation

Character-Level Incremental Speech Recognition with Recurrent Neural Networks

Size Ramsey numbers of stars versus cliques

On the rate of convergence in de Finetti’s representation theorem

Generalizing Prototype Theory: A Formal Quantum Framework

Bayesian Estimation of Bipartite Matchings for Record Linkage

Time-Varying Gaussian Process Bandit Optimization

Testing for Causality in Continuous Time Bayesian Network Models of High-Frequency Data

Pricing Vehicle Sharing with Proximity Information

Conditional distribution variability measures for causality detection

Clustering from Sparse Pairwise Measurements

Catalan triangle numbers and binomial coefficients

A Novel Graph-based Approach for Determining Molecular Similarity

On a Hypergraph Approach to Multistage Group Testing Problems

Second order analysis of geometric functionals of Boolean models

Optimal designs for comparing regression models with correlated observations

Concept Generation in Language Evolution

Long Short-Term Memory-Networks for Machine Reading

A Label Semantics Approach to Linguistic Hedges

On the probability that two elements of a finite semigroup have the same right matrix

On estimating causal controlled direct and mediator effects for count outcomes without assuming sequential ignorability

Empirical bayes formulation of the elastic net and mixed-norm models: application to the eeg inverse problem

A Robust UCB Scheme for Active Learning in Regression from Strategic Crowds

The Utility of Hedged Assertions in the Emergence of Shared Categorical Labels

Emerging Dimension Weights in a Conceptual Spaces Model of Concept Combination