A deep matrix factorization method for learning attribute representations

Semi-Non-negative Matrix Factorization is a technique that learns a low-dimensional representation of a dataset that lends itself to a clustering interpretation. It is possible that the mapping between this new representation and our original data matrix contains rather complex hierarchical information with implicit lower-level hidden attributes, that classical one level clustering methodologies can not interpret. In this work we propose a novel model, Deep Semi-NMF, that is able to learn such hidden representations that allow themselves to an interpretation of clustering according to different, unknown attributes of a given dataset. We also present a semi-supervised version of the algorithm, named Deep WSF, that allows the use of (partial) prior information for each of the known attributes of a dataset, that allows the model to be used on datasets with mixed attribute knowledge. Finally, we show that our models are able to learn low-dimensional representations that are better suited for clustering, but also classification, outperforming Semi-Non-negative Matrix Factorization, but also other state-of-the-art methodologies variants.

A New Generalized Cassini Determinant

In this paper we extend a notion of Cassini determinant to recently introduced hyperfibonacci sequences. We find Q-matrix for the r-th generation hyperfibonacci numbers and prove an explicit expression of the Cassini determinant for these sequences.

Comparison of hazard rate estimation in R

We give an overview of eight different software packages and functions available in R for semi- or non-parametric estimation of the hazard rate for right-censored survival data. Of particular interest is the accuracy of the estimation of the hazard rate in the presence of covariates, as well as the user-friendliness of the packages. In addition, we investigate the ability to incorporate covariates under both the proportional and the non-proportional hazards assumptions. We contrast the robustness, variability and precision of the functions through simulations, and then further compare differences between the functions by analyzing the ‘cancer’ and ‘TRACE’ survival data sets available in R, including covariates under the proportional and non-proportional hazards settings.

Compatible Value Gradients for Reinforcement Learning of Continuous Deep Policies

This paper proposes GProp, a deep reinforcement learning algorithm for continuous policies with compatible function approximation. The algorithm is based on two innovations. Firstly, we present a temporal-difference based method for learning the gradient of the value-function. Secondly, we present the deviator-actor-critic (DAC) model, which comprises three neural networks that estimate the value function, its gradient, and determine the actor’s policy respectively. We evaluate GProp on two challenging tasks: a contextual bandit problem constructed from nonparametric regression datasets that is designed to probe the ability of reinforcement learning algorithms to accurately estimate gradients; and the octopus arm, a challenging reinforcement learning benchmark. GProp is competitive with fully supervised methods on the bandit task and achieves the best performance to date on the octopus arm.

Continuous control with deep reinforcement learning

We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain. We present an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces. Using the same learning algorithm, network architecture and hyper-parameters, our algorithm robustly solves more than 20 simulated physics tasks, including classic problems such as cartpole swing-up, dexterous manipulation, legged locomotion and car driving. Our algorithm is able to find policies whose performance is competitive with those found by a planning algorithm with full access to the dynamics of the domain and its derivatives. We further demonstrate that for many of the tasks the algorithm can learn policies end-to-end: directly from raw pixel inputs.

De Bruijn entropy and string similarity

We introduce the notion of de Bruijn entropy of an Eulerian quiver and show how the corresponding relative entropy can be applied to practical string similarity problems. This approach explicitly links the combinatorial and information-theoretical properties of words and its performance is superior to edit distances in many respects and competitive in most others. The computational complexity of our current implementation is parametrically tunable between linear and cubic, and we outline how an optimized linear algebra subroutine can reduce the cubic complexity to approximately linear. Numerous examples are provided, including a realistic application to molecular phylogenetics.

Empirical Reference Distributions for Networks of Different Size

Network analysis has become an increasingly prevalent research tool across a vast range of scientific fields. Here, we focus on the particular issue of comparing network statistics, i.e. graph-level measures of network structural features, across multiple networks that differ in size. Although ‘normalized’ versions of some network statistics exist, we demonstrate via simulation why direct comparison of raw and normalized statistics is often inappropriate. We examine a recent suggestion to normalize network statistics relative to Erdos-Renyi random graphs and demonstrate via simulation how this is an improvement over direct comparison, but still sometimes problematic. We propose a new adjustment method based on a reference distribution constructed as a mixture model of random graphs which reflect the dependence structure exhibited in the observed networks. We show that using simple Bernoulli models as mixture components in this reference distribution can provide adjusted network statistics that are relatively comparable across different network sizes but still describe interesting features of networks, and that this can be accomplished at relatively low computational expense. Finally, we apply this methodology to a collection of co-location networks derived from the Los Angeles Family and Neighborhood Survey activity location data.

Fast Exact Shortest Path and Distance Queries on Road Networks with Parametrized Costs

We study a scenario for route planning in road networks, where the objective to be optimized may change between every shortest path query. Since this invalidates many of the known speedup techniques for road networks that are based on preprocessing of shortest path structures, we investigate optimizations exploiting solely the topological structure of networks. We experimentally evaluate our technique on a large set of real-world road networks of various data sources. With lightweight preprocessing our technique answers long-distance queries across continental networks significantly faster than previous approaches towards the same problem formulation.

Gibbs Sampling Strategies for Semantic Perception of Streaming Video Data

Topic modeling of streaming sensor data can be used for high level perception of the environment by a mobile robot. In this paper we compare various Gibbs sampling strategies for topic modeling of streaming spatiotemporal data, such as video captured by a mobile robot. Compared to previous work on online topic modeling, such as o-LDA and incremental LDA, we show that the proposed technique results in lower online and final perplexity, given the realtime constraints.

Recurrent Reinforcement Learning: A Hybrid Approach

Successful applications of reinforcement learning in real-world problems often require dealing with partially observable states. It is in general very challenging to construct and infer hidden states as they often depend on the agent’s entire interaction history and may require substantial domain knowledge. In this work, we investigate a deep-learning approach to learning the representation of states in partially observable tasks, with minimal prior knowledge of the domain. In particular, we study reinforcement learning with deep neural networks, including RNN and LSTM, which are equipped with the desired property of being able to capture long-term dependency on history, and thus providing an effective way of learning the representation of hidden states. We further develop a hybrid approach that combines the strength of both supervised learning (for representing hidden states) and reinforcement learning (for optimizing control) with joint training. Extensive experiments based on a KDD Cup 1998 direct mailing campaign problem demonstrate the effectiveness and advantages of the proposed approach, which performs the best across the board.

Sensor Selection by Linear Programming

We learn sensor trees from training data to minimize sensor acquisition costs during test time. Our system adaptively selects sensors at each stage if necessary to make a confident classification. We pose the problem as empirical risk minimization over the choice of trees and node decision rules. We decompose the problem, which is known to be intractable, into combinatorial (tree structures) and continuous parts (node decision rules) and propose to solve them separately. Using training data we greedily solve for the combinatorial tree structures and for the continuous part, which is a non-convex multilinear objective function, we derive convex surrogate loss functions that are piecewise linear. The resulting problem can be cast as a linear program and has the advantage of guaranteed convergence, global optimality, repeatability and computational efficiency. We show that our proposed approach outperforms the state-of-art on a number of benchmark datasets.

A Closer Look at Testing the ‘No-Treatment-Effect’ Hypothesis in a Comparative Experiment

A Remark on the Second Neighborhood Problem

Adaptive estimation for bifurcating Markov chains

An Epsilon Hierarchical Fuzzy Twin Support Vector Regression

Biplane Level Set

Cayley properties of merged Johnson graphs

Characteristic Sign Renewals of Kardar-Parisi-Zhang Fluctuations

Coarse-to-Fine Sequential Monte Carlo for Probabilistic Programs

Complex spherical codes with three inner products

Counting self-avoiding walks on free products of graphs

Density Evolution in the Degree-correlated Stochastic Block Model

Ecological fallacy and covariates: new insights based on multilevel modelling of individual data

Entropic CLT and phase transition in high-dimensional Wishart matrices

Entropic fluctuations in Gaussian dynamical systems

Estimating Drift Parameters in a Fractional Ornstein Uhlenbeck Process with Periodic Mean

Execution-Cache-Memory Performance Model: Introduction and Validation

Explicit Bounds for Nondeterministically Testable Hypergraph Parameters

Extreme value statistics of 2d Gaussian Free Field: effect of finite domains

Fast low-rank estimation by projected gradient descent: General statistical and algorithmic guarantees

Geometric Bijections Between Spanning Trees and Break Divisors

Grid-Based Belief Propagation for Cooperative Localization

Hyperplane mass partitions via relative equivariant obstruction theory

MAP Estimators for Piecewise Continuous Inversion

Multibasic Ehrhart theory

OCR extensions – local identifiers, labeled GUIDs, file IO, and data block partitioning

On computability and disintegration

On edge-decomposition of cubic graphs into copies of the double-star with four edges

On Generalized Sierpiński Graphs

On the Multidimensional Stable Marriage Problem

On uniquely 3-colorable plane graphs without prescribed adjacent faces

Online Buy-at-Bulk Network Design

Performance Bounds for Pairwise Entity Resolution

Phase diagrams of disordered Weyl semimetals

Quantum decay of the supercurrent and intrinsic capacitance of Josephson junctions beyond the tunnel limit

Regular Graphs with Forbidden Subgraphs of $K_n$ with $k$ Edges

Semismooth Newton Coordinate Descent Algorithm for Elastic-Net Penalized Huber Loss and Quantile Regression

Short-Term Wind Speed Forecasting in Germany

Single Particle, Passive Microrheology in Biological Fluids with Drift

Statistical Topology of Perturbed Two-Dimensional Lattices

The extremal process of critical points of the pure $p$-spin spherical spin glass model

The World of Combinatorial Fuzzy Problems and the Efficiency of Fuzzy Approximation Algorithms

The worm process for the Ising model is rapidly mixing

Tournaments, 4-uniform hypergraphs, and an exact extremal result

Two betweenness centrality measures based on Randomized Shortest Paths

Uniform dimension results for fractional Brownian motion

Use it or Lose it: Selective Memory and Forgetting in a Perpetual Learning Machine

Volume polynomials and duality algebras of multi-fans