Whats new on arXiv

What is Information?

Information is a precise concept that can be defined mathematically, but its relationship to what we call ‘knowledge’ is not always made clear. Furthermore, the concepts ‘entropy’ and ‘information’, while deeply related, are distinct and must be used with care, something that is not always achieved in the literature. In this elementary introduction, the concepts of entropy and information are laid out one by one, explained intuitively, but defined rigorously. I argue that a proper understanding of information in terms of prediction is key to a number of disciplines beyond engineering, such as physics and biology.

Nested Orthogonal Arrays

Orthogonal Arrays allow us to test various levels of each factor and balance the different factors so that we can estimate interactions as well as first order effects. There is a trade-off between how well we can sample different levels of each factor and how many interactions we are able to estimate. This paper describes one method to mitigate this trade-off. This method will allow us, with n observations, to sample n levels of each factor and minimize the correlation between the estimates of first order terms and their interactions.

Quegel: A General-Purpose Query-Centric Framework for Querying Big Graphs

Pioneered by Google’s Pregel, many distributed systems have been developed for large-scale graph analytics. These systems expose the user-friendly ‘think like a vertex’ programming interface to users, and exhibit good horizontal scalability. However, these systems are designed for tasks where the majority of graph vertices participate in computation, but are not suitable for processing light-workload graph queries where only a small fraction of vertices need to be accessed. The programming paradigm adopted by these systems can seriously under-utilize the resources in a cluster for graph query processing. In this work, we develop a new open-source system, called Quegel, for querying big graphs, which treats queries as first-class citizens in the design of its computing model. Users only need to specify the Pregel-like algorithm for a generic query, and Quegel processes light-workload graph queries on demand using a novel superstep-sharing execution model to effectively utilize the cluster resources. Quegel further provides a convenient interface for constructing graph indexes, which significantly improve query performance but are not supported by existing graph-parallel systems. Our experiments verified that Quegel is highly efficient in answering various types of graph queries and is up to orders of magnitude faster than existing systems.

Is swarm intelligence able to create mazes?

In this paper, the idea of applying Computational Intelligence in the process of creation board games, in particular mazes, is presented. For two different algorithms the proposed idea has been examined. The results of the experiments are shown and discussed to present advantages and disadvantages.

We present a novel algorithm for anomaly detection on very large datasets and data streams. The method, named EXPected Similarity Estimation (EXPoSE), is kernel-based and able to efficiently compute the similarity between new data points and the distribution of regular data. The estimator is formulated as an inner product with a reproducing kernel Hilbert space embedding and makes no assumption about the type or shape of the underlying data distribution. We show that offline (batch) learning with EXPoSE can be done in linear time and online (incremental) learning takes constant time per instance and model update. Furthermore, EXPoSE can make predictions in constant time, while it requires only constant memory. In addition we propose different methodologies for concept drift adaptation on evolving data streams. On several real datasets we demonstrate that our approach can compete with state of the art algorithms for anomaly detection while being significant faster than techniques with the same discriminant power.

A Taxonomy of Deep Convolutional Neural Nets for Computer Vision

Traditional architectures for solving computer vision problems and the degree of success they enjoyed have been heavily reliant on hand-crafted features. However, of late, deep learning techniques have offered a compelling alternative — that of automatically learning problem-specific features. With this new paradigm, every problem in computer vision is now being re-examined from a deep learning perspective. Therefore, it has become important to understand what kind of deep networks are suitable for a given problem. Although general surveys of this fast-moving paradigm (i.e. deep-networks) exist, a survey specific to computer vision is missing. We specifically consider one form of deep networks widely used in computer vision – convolutional neural networks (CNNs). We start with ‘AlexNet’ as our base CNN and then examine the broad variations proposed over time to suit different applications. We hope that our recipe-style survey will serve as a guide, particularly for novice practitioners intending to use deep-learning techniques for computer vision.

Pixel Recurrent Neural Networks

Modeling the distribution of natural images is a landmark problem in unsupervised learning. This task requires an image model that is at once expressive, tractable and scalable. We present a deep neural network that sequentially predicts the pixels in an image along the two spatial dimensions. Our method models the discrete probability of the raw pixel values and encodes the complete set of dependencies in the image. Architectural novelties include fast two-dimensional recurrent layers and an effective use of residual connections in deep recurrent networks. We achieve log-likelihood scores on natural images that are considerably better than the previous state of the art. Our main results also provide benchmarks on the diverse ImageNet dataset. Samples generated from the model appear crisp, varied and globally coherent.

• On the Latent Variable Interpretation in Sum-Product Networks

• Superfluid–Insulator Transition in Strongly Disordered One-dimensional Systems

• Branching Rules for Symmetric Hypergeometric Polynomials

• Universal Collaboration Strategies for Signal Detection: A Sparse Learning Approach

• Rectified Gaussian Scale Mixtures and the Sparse Non-Negative Least Squares Problem

• Nonparametric Heterogeneity Testing For Massive Data

• Large Deviation Principle For Finite-State Mean Field Interacting Particle Systems

• Precise Error Analysis of Regularized M-estimators in High-dimensions

• A Mixed-effects Model for Incomplete Data With Batch-Level Abundance-Dependent Missing-Data Mechanism

• Divide and Conquer Local Average Regression

• Artificial Persuasion in Pedagogical Games

• Automatic recognition of element classes and boundaries in the birdsong with variable sequences

• A proof of the Square Paths Conjecture

• Minimax Lower Bounds for Linear Independence Testing

• Ergodicity of inhomogeneous Markov chains through asymptotic pseudotrajectories

• Large deviation principle of occupation measures for Non-linear monotone SPDEs

• Smooth densities of the laws of perturbed diffusion processes

• Fortuitous sequences of flips of the top of a stack of n burnt pancakes for all n>24

• Using an Instrumental Variable to Test for Unmeasured Confounding

• A Characterization for the Existence of Connected $f$-Factors of $\textit{ Large}$ Minimum Degree

• Mediation Analysis for Count and Zero-Inflated Count Data without Sequential Ignorability

• $LS$-Category of Moment-Angle Manifolds and a Generalization of the Golod Property

• Undecidability of the Lambek calculus with a relevant modality

• Change-Sensitive Algorithms for Maintaining Maximal Cliques in a Dynamic Graph

• Exit Laws of Isotropic Diffusions in Random Environment from Large Domains

• An upper bound on the size of diamond-free families of sets

• Coexistence and Exclusion of Stochastic Competitive Lotka-Volterra Models

• Fast Binary Embedding via Circulant Downsampled Matrix — A Data-Independent Approach

• Inference of Latent Network Features via Co-Intersection Representations of Graphs

• Interval Estimation for Conditional Failure Rates of Transmission Lines with Limited Samples

• Product of Non-Hermitian Random Matrices

• Domino Tilings of the Torus

• On the conjecture by Demyanov-Ryabova in converting finite exhausters

• Crystallizing the hypoplactic monoid: from quasi-Kashiwara operators to the Robinson–Schensted-type correspondence for quasi-ribbon tableaux

• Randomly juggling backwards

• On detecting and quantification of randomness for one-sided sequences

• Discrete quantitative nodal theorem

• Synthesis of Gaussian Trees with Correlation Sign Ambiguity: An Information Theoretic Approach

• A survey of Elekes-Rónyai-type problems

• Stochastic Quantization for the fractional Edwards Measure

• On generalized Gaussian free fields and stochastic homogenization

• A New Information Theoretical Concept: Information-Weighted Heavy-tailed Distributions

• Pairing of Zeros and Critical Points for Random Polynomials

• Explicit moments of decision times for single- and double-threshold drift-diffusion processes

• Accelerated Nonparametric Maximum Likelihood Density Deconvolution Using Bernstein Polynomial

• Sequential Hypothesis Test with Online Usage-Constrained Sensor Selection

• Analysis of centrality in sublinear preferential attachment trees via the CMJ branching process

• On universal partial words over binary alphabets

• Strata of discriminantal arrangements

• A new correlation clustering method for cancer mutation analysis

• Congruences and recursions for the cubic partitions

• Lightweight Fault Tolerance in Large-Scale Distributed Graph Processing

• The peak statistics on simsun permutations

• On multiplier processes under weak moment assumptions

• Meta-analysis of few small studies in orphan diseases

• Discrete analogues of Macdonald-Mehta integrals

• Finite sample properties of the mean occupancy counts and probabilities

• Robust Influence Maximization

• On Rényi Entropy Power Inequalities

• From survival to extinction of the contact process by the removal of a single edge

• Towards Resolving Unidentifiability in Inverse Reinforcement Learning

• A Kernel Independence Test for Geographical Language Variation

• Character-Level Incremental Speech Recognition with Recurrent Neural Networks

• Size Ramsey numbers of stars versus cliques

• On the rate of convergence in de Finetti’s representation theorem

• Generalizing Prototype Theory: A Formal Quantum Framework

• Bayesian Estimation of Bipartite Matchings for Record Linkage

• Time-Varying Gaussian Process Bandit Optimization

• Testing for Causality in Continuous Time Bayesian Network Models of High-Frequency Data

• Pricing Vehicle Sharing with Proximity Information

• Conditional distribution variability measures for causality detection

• Clustering from Sparse Pairwise Measurements

• Catalan triangle numbers and binomial coefficients

• A Novel Graph-based Approach for Determining Molecular Similarity

• On a Hypergraph Approach to Multistage Group Testing Problems

• Second order analysis of geometric functionals of Boolean models

• Optimal designs for comparing regression models with correlated observations

• Concept Generation in Language Evolution

• Long Short-Term Memory-Networks for Machine Reading

• A Label Semantics Approach to Linguistic Hedges

• On the probability that two elements of a finite semigroup have the same right matrix

• On estimating causal controlled direct and mediator effects for count outcomes without assuming sequential ignorability

• Empirical bayes formulation of the elastic net and mixed-norm models: application to the eeg inverse problem

• A Robust UCB Scheme for Active Learning in Regression from Strategic Crowds

• The Utility of Hedged Assertions in the Emergence of Shared Categorical Labels

• Emerging Dimension Weights in a Conceptual Spaces Model of Concept Combination

AnalytiXon

~ Broaden your Horizon

Whats new on arXiv

Like this:

Leave a ReplyCancel reply

Share this:

Like this:

Leave a ReplyCancel reply

Discover more from AnalytiXon