Whats new on arXiv

Maximum Margin Interval Trees

Learning a regression function using censored or interval-valued output data is an important problem in fields such as genomics and medicine. The goal is to learn a real-valued prediction function, and the training output labels indicate an interval of possible values. Whereas most existing algorithms for this task are linear models, in this paper we investigate learning nonlinear tree models. We propose to learn a tree by minimizing a margin-based discriminative objective function, and we provide a dynamic programming algorithm for computing the optimal solution in log-linear time. We show empirically that this algorithm achieves state-of-the-art speed and prediction accuracy in a benchmark of several data sets.

ALAN: Adaptive Learning for Multi-Agent Navigation

In multi-agent navigation, agents need to move towards their goal locations while avoiding collisions with other agents and static obstacles, often without communication with each other. Existing methods compute motions that are optimal locally but do not account for the aggregated motions of all agents, producing inefficient global behavior especially when agents move in a crowded space. In this work, we develop methods to allow agents to dynamically adapt their behavior to their local conditions. We accomplish this by formulating the multi-agent navigation problem as an action-selection problem, and propose an approach, ALAN, that allows agents to compute time-efficient and collision-free motions. ALAN is highly scalable because each agent makes its own decisions on how to move using a set of velocities optimized for a variety of navigation tasks. Experimental results show that the agents using ALAN, in general, reach their destinations faster than using ORCA, a state-of-the-art collision avoidance framework, the Social Forces model for pedestrian navigation, and a Predictive collision avoidance model.

What Would a Graph Look Like in This Layout? A Machine Learning Approach to Large Graph Visualization

Using different methods for laying out a graph can lead to very different visual appearances, with which the viewer perceives different information. Selecting a ‘good’ layout method is thus important for visualizing a graph. The selection can be highly subjective and dependent on the given task. A common approach to selecting a good layout is to use aesthetic criteria and visual inspection. However, fully calculating various layouts and their associated aesthetic metrics is computationally expensive. In this paper, we present a machine learning approach to large graph visualization based on computing the topological similarity of graphs using graph kernels. For a given graph, our approach can show what the graph would look like in different layouts and estimate their corresponding aesthetic metrics. An important contribution of our work is the development of a new framework to design graph kernels. Our experimental study shows that our estimation calculation is considerably faster than computing the actual layouts and their aesthetic metrics. Also, our graph kernels outperform the state-of-the-art ones in both time and accuracy. In addition, we conducted a user study to demonstrate that the topological similarity computed with our graph kernel matches perceptual similarity assessed by human users.

Deep Learning in Multiple Multistep Time Series Prediction

The project aims to research on combining deep learning specifically Long-Short Memory (LSTM) and basic statistics in multiple multistep time series prediction. LSTM can dive into all the pages and learn the general trends of variation in a large scope, while the well selected medians for each page can keep the special seasonality of different pages so that the future trend will not fluctuate too much from the reality. A recent Kaggle competition on 145K Web Traffic Time Series Forecasting [1] is used to thoroughly illustrate and test this idea.

Sum-Product-Quotient Networks

We present a novel tractable generative model that extends Sum-Product Networks (SPNs) and significantly boost their power. We call it Sum-Product-Quotient Networks (SPQNs), whose core concept is to incorporate conditional distributions into the model by direct computation using quotient nodes, e.g. $P(A|B){=}\frac{P(A,B)}{P(B)}$ . We provide sufficient conditions for the tractability of SPQNs that generalize and relax the decomposable and complete tractability conditions of SPNs. These relaxed conditions give rise to an exponential boost to the expressive efficiency of our model, i.e. we prove that there are distributions which SPQNs can compute efficiently but require SPNs to be of exponential size. Thus, we narrow the gap in expressivity between tractable graphical models and other Neural Network-based generative models.

Querying Best Paths in Graph Databases

Querying graph databases has recently received much attention. We propose a new approach to this problem, which balances competing goals of expressive power, language clarity and computational complexity. A distinctive feature of our approach is the ability to express properties of minimal (e.g. shortest) and maximal (e.g. most valuable) paths satisfying given criteria. To express complex properties in a modular way, we introduce labelling-generating ontologies. The resulting formalism is computationally attractive — queries can be answered in non-deterministic logarithmic space in the size of the database.

Self-Taught Support Vector Machine

In this paper, a new approach for classification of target task using limited labeled target data as well as enormous unlabeled source data is proposed which is called self-taught learning. The target and source data can be drawn from different distributions. In the previous approaches, covariate shift assumption is considered where the marginal distributions p(x) change over domains and the conditional distributions p(y|x) remain the same. In our approach, we propose a new objective function which simultaneously learns a common space T(.) where the conditional distributions over domains p(T(x)|y) remain the same and learns robust SVM classifiers for target task using both source and target data in the new representation. Hence, in the proposed objective function, the hidden label of the source data is also incorporated. We applied the proposed approach on Caltech-256, MSRC+LMO datasets and compared the performance of our algorithm to the available competing methods. Our method has a superior performance to the successful existing algorithms.

New efficient algorithms for multiple change-point detection with kernels

Several statistical approaches based on reproducing kernels have been proposed to detect abrupt changes arising in the full distribution of the observations and not only in the mean or variance. Some of these approaches enjoy good statistical properties (oracle inequality, \ldots). Nonetheless, they have a high computational cost both in terms of time and memory. This makes their application difficult even for small and medium sample sizes ( $n< 10^4$ ). This computational issue is addressed by first describing a new efficient and exact algorithm for kernel multiple change-point detection with an improved worst-case complexity that is quadratic in time and linear in space. It allows dealing with medium size signals (up to $n \approx 10^5$ ). Second, a faster but approximation algorithm is described. It is based on a low-rank approximation to the Gram matrix. It is linear in time and space. This approximation algorithm can be applied to large-scale signals ( $n \geq 10^6$ ). These exact and approximation algorithms have been implemented in \texttt{R} and \texttt{C} for various kernels. The computational and statistical performances of these new algorithms have been assessed through empirical experiments. The runtime of the new algorithms is observed to be faster than that of other considered procedures. Finally, simulations confirmed the higher statistical accuracy of kernel-based approaches to detect changes that are not only in the mean. These simulations also illustrate the flexibility of kernel-based approaches to analyze complex biological profiles made of DNA copy number and allele B frequencies. An R package implementing the approach will be made available on github.

Is Epicurus the father of Reinforcement Learning?

The Epicurean Philosophy is commonly thought as simplistic and hedonistic. Here I discuss how this is a misconception and explore its link to Reinforcement Learning. Based on the letters of Epicurus, I construct an objective function for hedonism which turns out to be equivalent of the Reinforcement Learning objective function when omitting the discount factor. I then discuss how Plato and Aristotle ‘s views that can be also loosely linked to Reinforcement Learning, as well as their weaknesses in relationship to it. Finally, I emphasise the close affinity of the Epicurean views and the Bellman equation.

• Joint Image Filtering with Deep Convolutional Networks
• Subsampling large graphs and invariance in networks
• Machine Learning Bell Nonlocality in Quantum Many-body Systems
• Gapless insulating edges of dirty interacting topological insulators
• Regression-aware decompositions
• Universal layered permutations
• Monotonicity axioms in approval-based multi-winner voting rules
• Local Convergence of Proximal Splitting Methods for Rank Constrained Problems
• Multipoint Estimates for Radial and Whole-plane SLE
• Solutions of Quadratic First-Order ODEs applied to Computer Vision Problems
• Stochastic Gradient Descent in Continuous Time: A Central Limit Theorem
• Short and Long Ranged Impurities in Fractional Quantum Hall Systems
• Physical-Layer Network Coding with Multiple Antennas: An Enabling Technology for Smart Cities
• On the local stability of semidefinite relaxations
• What can spin glass theory and analogies tell us about ferroic glasses?
• Modeling and sensitivity analysis methodology for hybrid dynamical systems
• Recurrence sequences connected with the $m$–ary partition function and their divisibility properties
• Measurement Context Extraction from Text: Discovering Opportunities and Gaps in Earth Science
• Harvested Power Maximization in QoS-Constrained MIMO SWIPT with Generic RF Harvesting Model
• Explaining Trained Neural Networks with Semantic Web Technologies: First Steps
• Improved Coresets for Kernel Density Estimates
• Efficient Data-Driven Geologic Feature Detection from Pre-stack Seismic Measurements using Randomized Machine-Learning Algorithm
• Modular decomposition of transitive graphs and transitively orienting their complements
• DisSent: Sentence Representation Learning from Explicit Discourse Relations
• One-sided solutions for optimal stopping problems with logconcave reward functions
• Learning Koopman Invariant Subspaces for Dynamic Mode Decomposition
• Using Context Events in Neural Network Models for Event Temporal Status Identification
• A Finite Element Computational Framework for Active Contours on Graphs
• Asymptotic theory for maximum likelihood estimates in reduced-rank multivariate generalised linear models
• A Unified Neural Network Approach for Estimating Travel Time and Distance for a Taxi Trip
• Enhanced Mobile Computing Experience with Cloud Offloading
• Designing Low-Complexity Heavy-Traffic Delay-Optimal Load Balancing Schemes: Theory to Algorithms
• Fast initial guess estimation for digital image correlation
• Linear Programming Bounds for Distributed Storage Codes
• Utility maximization problem under transaction costs: optimal dual processes and stability
• The Inverse Gamma-Gamma Prior for Optimal Posterior Contraction and Multiple Hypothesis Testing
• Fast, Accurate and Fully Parallelizable Digital Image Correlation
• On the Power of Tree-Depth for Fully Polynomial FPT Algorithms
• Evaluating discriminatory accuracy of models using partial risk-scores in two-phase studies
• Sign-Constrained Regularized Loss Minimization
• An Augmented Nonlinear Complex LMS for Digital Self-Interference Cancellation in Full-Duplex Direct-Conversion Transceivers
• Marginal sequential Monte Carlo for doubly intractable models
• Securing UAV Communications Via Trajectory Optimization
• Provably Fair Representations
• The Sierpiński gasket as the Martin boundary of a non-isotropic Markov chain
• Multi-Batch Experience Replay for Fast Convergence of Continuous Action Control
• The Location-Allocation Model for Multi-Classification-Yard Location Problem in a Railway Network
• Revisiting the Design Issues of Local Models for Japanese Predicate-Argument Structure Analysis
• Non-abelian finite groups whose character sums are invariant but are not Cayley isomorphism
• Robust Parameter Estimation of Regression Model with AR(p) Error Terms
• Arguing Machines: Perception-Control System Redundancy and Edge Case Discovery in Real-World Autonomous Driving
• Geometry of large Boltzmann outerplanar maps
• An Improved Naive Bayes Classifier-based Noise Detection Technique for Classifying User Phone Call Behavior
• Effects of Images with Different Levels of Familiarity on EEG
• Markerless visual servoing on unknown objects for humanoid robot platforms
• Pure Operation-Based Replicated Data Types
• V1: A Visual Query Language for Property Graphs
• Risk assessment using suprema data
• VOIDD: automatic vessel of intervention dynamic detection in PCI procedures
• Multimodal Observation and Interpretation of Subjects Engaged in Problem Solving
• Clusters of Driving Behavior from Observational Smartphone Data
• Convolutional Attention-based Seq2Seq Neural Network for End-to-End ASR
• Supersaturated sparse graphs and hypergraphs
• Optimal Control of PDEs using Occupation Measures and SDP Relaxations
• Subjectively Interesting Subgroup Discovery on Real-valued Targets
• The co-Pieri rule for Kronecker coefficients
• Wild Bootstrapping Rank-Based Procedures: Multiple Testing in Nonparametric Split-Plot Designs
• On the Containment Problem for Linear Sets
• Hierarchical Convolutional-Deconvolutional Neural Networks for Automatic Liver and Tumor Segmentation
• Measuring the spatial balance of a sample: A new measure based on the Moran’s I index
• The Trees of Hanoi
• Lattice point visibility on generalized lines of sight
• On a Generalization of the Arcsine Law
• Bayesian Modeling of the Structural Connectome for Studying Alzheimer Disease
• Inference for partial correlation when data are missing not at random
• On free Generalized Inverse Gaussian distributions
• A probabilistic view on the deterministic mutation-selection equation: dynamics, equilibria, and ancestry via individual lines of descent
• Balancing expression dags for more efficient lazy adaptive evaluation
• Additivity of Information in Multilayer Networks via Additive Gaussian Noise Transforms
• Towards Scalable Spectral Clustering via Spectrum-Preserving Sparsification
• Particle Filtering for Stochastic Navier-Stokes Signal Observed with Linear Additive Noise
• Throughput Region of Spatially Correlated Interference Packet Networks
• Constructing Directed Cayley Graphs of Small Diameter: A Potent Solovay-Kitaev Procedure
• Auto Analysis of Customer Feedback using CNN and GRU Network
• Riordan graphs I: Structural properties
• Finite size corrections to the Parisi overlap function in the GREM
• On the resolution of l0 minimization problems via alternating Lagrangian schemes
• Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation
• Analysis of planar ornament patterns via motif asymmetry assumption and local connections
• A generalization of Erdős’ matching conjecture
• Hard and Easy Instances of L-Tromino Tilings
• Secret-Key Generation in Many-to-One Networks: An Integrated Game-Theoretic and Information-Theoretic Approach
• Progressive Representation Adaptation for Weakly Supervised Object Localization

AnalytiXon

~ Broaden your Horizon

Whats new on arXiv

Like this:

Leave a ReplyCancel reply

Share this:

Like this:

Leave a ReplyCancel reply

Discover more from AnalytiXon