Whats new on arXiv

Enhancements of linked data expressiveness for ontologies

The semantic web has received many contributions of researchers as ontologies which, in this context, i.e. within RDF linked data, are formalized conceptualizations that might use different protocols, such as RDFS, OWL DL and OWL FULL. In this article, we describe new expressive techniques which were found necessary after elaborating dozens of OWL ontologies for the scientific academy, the State and the civil society. They consist in: 1) stating possible uses a property might have without incurring into axioms or restrictions; 2) assigning a level of priority for an element (class, property, triple); 3) correct depiction in diagrams of relations between classes, between individuals which are imperative, and between individuals which are optional; 4) a convenient association between OWL classes and SKOS concepts. We propose specific rules to accomplish these enhancements and exemplify both its use and the difficulties that arise because these techniques are currently not established as standards to the ontology designer.

Stochastic Conjugate Gradient Algorithm with Variance Reduction

Conjugate gradient methods are a class of important methods for solving linear equations and nonlinear optimization. In our work, we propose a new stochastic conjugate gradient algorithm with variance reduction (CGVR) and prove its linear convergence with the Fletcher and Revves method for strongly convex and smooth functions. We experimentally demonstrate that the CGVR algorithm converges faster than its counterparts for six large-scale optimization problems that may be convex, non-convex or non-smooth, and its AUC (Area Under Curve) performance with $L2$ -regularized $L2$ -loss is comparable to that of LIBLINEAR but with significant improvement in computational efficiency.

Diagonal Likelihood Ratio Test for Equality of Mean Vectors in High-Dimensional Data

We propose a likelihood ratio test framework for testing normal mean vectors in high-dimensional data under two common scenarios: the one-sample test and the two-sample test with equal covariance matrices. We derive the test statistics under the assumption that the covariance matrices follow a diagonal matrix structure. In comparison with the diagonal Hotelling’s tests, our proposed test statistics display some interesting characteristics. In particular, they are a summation of the log-transformed squared t-statistics rather than a direct summation of those components. More importantly, to derive the asymptotic normality of our test statistics under the null and local alternative hypotheses, we do not require the assumption that the covariance matrix follows a diagonal matrix structure. As a consequence, our proposed test methods are very flexible and can be widely applied in practice. Finally, simulation studies and a real data analysis are also conducted to demonstrate the advantages of our likelihood ratio test method.

Regularization via Mass Transportation

The goal of regression and classification methods in supervised learning is to minimize the empirical risk, that is, the expectation of some loss function quantifying the prediction error under the empirical distribution. When facing scarce training data, overfitting is typically mitigated by adding regularization terms to the objective that penalize hypothesis complexity. In this paper we introduce new regularization techniques using ideas from distributionally robust optimization, and we give new probabilistic interpretations to existing techniques. Specifically, we propose to minimize the worst-case expected loss, where the worst case is taken over the ball of all (continuous or discrete) distributions that have a bounded transportation distance from the (discrete) empirical distribution. By choosing the radius of this ball judiciously, we can guarantee that the worst-case expected loss provides an upper confidence bound on the loss on test data, thus offering new generalization bounds. We prove that the resulting regularized learning problems are tractable and can be tractably kernelized for many popular loss functions. We validate our theoretical out-of-sample guarantees through simulated and empirical experiments.

Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task Learning

Deep learning (DL) advances state-of-the-art reinforcement learning (RL), by incorporating deep neural networks in learning representations from the input to RL. However, the conventional deep neural network architecture is limited in learning representations for multi-task RL (MT-RL), as multiple tasks can refer to different kinds of representations. In this paper, we thus propose a novel deep neural network architecture, namely generalization tower network (GTN), which can achieve MT-RL within a single learned model. Specifically, the architecture of GTN is composed of both horizontal and vertical streams. In our GTN architecture, horizontal streams are used to learn representation shared in similar tasks. In contrast, the vertical streams are introduced to be more suitable for handling diverse tasks, which encodes hierarchical shared knowledge of these tasks. The effectiveness of the introduced vertical stream is validated by experimental results. Experimental results further verify that our GTN architecture is able to advance the state-of-the-art MT-RL, via being tested on 51 Atari games.

Fine-Grained Pattern Matching Over Streaming Time Series

Processing of streaming time series data from sensors with lower latency and limited computing resource comes to a critical problem as the growth of Industry 4.0 and Industry Internet of Things(IIoT). To tackle the real world challenge in this area, like equipment health monitoring by comparing the incoming data stream with known fault patterns, we formulate a new problem, called ‘fine-grained pattern matching’. It allows users to define varied deviations to different segments of a given pattern, and fuzzy breakpoint of adjunct segments, which urges the dramatically increased complexity against traditional pattern matching problem over stream. In this paper, we propose a novel 2-phase approach to solve this problem. In pruning phase, we propose ELB(Equal Length Block) Representation and BSP (Block-Skipping Pruning) policy, which efficiently filter the unmatched subsequence with the guarantee of no-false dismissals. In post-processing phase, we provide an algorithm to further examine the possible matches in linear complexity. We conducted an extensive experimental evaluation on synthetic and real-world datasets, which illustrates that our algorithm outperforms the brute-force method and MSM, a multi-step filter mechanism over the multi-scaled representation, by orders of magnitude.

Inverse Reinforcement Learning Under Noisy Observations

We consider the problem of performing inverse reinforcement learning when the trajectory of the expert is not perfectly observed by the learner. Instead, a noisy continuous-time observation of the trajectory is provided to the learner. This problem exhibits wide-ranging applications and the specific application we consider here is the scenario in which the learner seeks to penetrate a perimeter patrolled by a robot. The learner’s field of view is limited due to which it cannot observe the patroller’s complete trajectory. Instead, we allow the learner to listen to the expert’s movement sound, which it can also use to estimate the expert’s state and action using an observation model. We treat the expert’s state and action as hidden data and present an algorithm based on expectation maximization and maximum entropy principle to solve the non-linear, non-convex problem. Related work considers discrete-time observations and an observation model that does not include actions. In contrast, our technique takes expectations over both state and action of the expert, enabling learning even in the presence of extreme noise and broader applications.

Progressive Growing of GANs for Improved Quality, Stability, and Variation

We describe a new training methodology for generative adversarial networks. The key idea is to grow both the generator and discriminator progressively: starting from a low resolution, we add new layers that model increasingly fine details as training progresses. This both speeds the training up and greatly stabilizes it, allowing us to produce images of unprecedented quality, e.g., CelebA images at 1024^2. We also propose a simple way to increase the variation in generated images, and achieve a record inception score of 8.80 in unsupervised CIFAR10. Additionally, we describe several implementation details that are important for discouraging unhealthy competition between the generator and discriminator. Finally, we suggest a new metric for evaluating GAN results, both in terms of image quality and variation. As an additional contribution, we construct a higher-quality version of the CelebA dataset.

Advanced LSTM: A Study about Better Time Dependency Modeling in Emotion Recognition

Long short-term memory (LSTM) is normally used in recurrent neural network (RNN) as basic recurrent unit. However,conventional LSTM assumes that the state at current time step depends on previous time step. This assumption constraints the time dependency modeling capability. In this study, we propose a new variation of LSTM, advanced LSTM (A-LSTM), for better temporal context modeling. We employ A-LSTM in weighted pooling RNN for emotion recognition. The A-LSTM outperforms the conventional LSTM by 5.5% relatively. The A-LSTM based weighted pooling RNN can also complement the state-of-the-art emotion classification framework. This shows the advantage of A-LSTM.

Tensor network language model

We propose a new statistical model suitable for machine learning tasks of systems with long distance correlations such as human languages. The model is based on directed acyclic graph decorated by multi-linear tensor maps in the vertices and vector spaces in the edges, called tensor network. Such tensor networks have been previously employed for effective numerical computation of the renormalization group flow on the space of effective quantum field theories and lattice models of statistical mechanics. We provide explicit algebro-geometric analysis of the parameter moduli space for tree graphs, discuss model properties and applications such as statistical translation.

Matrix Completion Methods for Causal Panel Data Models

In this paper we develop new methods for estimating causal effects in settings with panel data, where a subset of units are exposed to a treatment during a subset of periods, and the goal is estimating counterfactual (untreated) outcomes for the treated unit/period combinations. We develop a class of estimators that uses the observed elements of the matrix of control outcomes corresponding to untreated unit/periods to predict the ‘missing’ elements of the matrix, corresponding to treated units/periods. The approach estimates a matrix that well-approximates the original (incomplete) matrix, but has lower complexity according to a matrix norm, where we consider the family of Schatten norms based on the singular values of the matrix. The proposed methods have attractive computational properties. From a technical perspective, we generalize results from the matrix completion literature by allowing the patterns of missing data to have a time series dependency structure. We also present new insights concerning the connections between the interactive fixed effects models and the literatures on program evaluation under unconfoundedness as well as on synthetic control methods. If there are few time periods and many units, our method approximates a regression approach where counterfactual outcomes are estimated through a regression of current outcomes on lagged outcomes for the same unit. In contrast, if there are few units and many periods, our proposed method approximates a synthetic control estimator where counterfactual outcomes are estimated through a regression of the lagged outcomes for the treated unit on lagged outcomes for the control units. The advantage of our proposed method is that it moves seamlessly between these two different approaches, utilizing both cross-sectional and within-unit patterns in the data.

• Machine learning vortices at the Kosterlitz-Thouless transition
• A Note on Lower Bounds for Induced Ramsey Numbers
• Gradient Sparsification for Communication-Efficient Distributed Optimization
• Spectral invariants and playing hide-and-seek on surfaces
• Energy Clustering
• Understanding Grounded Language Learning Agents
• How far did we get in face spoofing detection?
• Bounds for traces of Hecke operators and applications to modular and elliptic curves over a finite field
• Exoplanet Atmosphere Retrieval using Multifractal Analysis of Reflectance Spectra
• Phase Transitions in Image Denoising via Sparsely Coding Convolutional Neural Networks
• Computing the Line Index of Balance Using Integer Programming Optimisation
• Exact results of the limited penetrable horizontal visibility graph associated to random time series and its application
• On Quantum Stabilizer Codes derived from Local Frobenius Rings
• Bayesian Nonparametric Models for Biomedical Data Analysis
• Global analysis of a geometric PDAV controller by means of coordinate-free linearization
• The spread of fire on a random multigraph
• Orthogonal Precoding for Ultra Reliable Wireless Communication Links
• Low-Complexity Equalization for Orthogonal Time and Frequency Signaling (OTFS)
• A De Giorgi Iteration-based Approach for the Establishment of ISS Properties of a Class of Semi-linear Parabolic PDEs with Boundary and In-domain Disturbances
• Exploiting Commutativity For Practical Fast Replication
• Evaluation of Treatment Effect Modification by Biomarkers Measured Pre- and Post-randomization in the Presence of Non-monotone Missingness
• Optimal Ensemble Control of Loads in Distribution Grids with Network Constraints
• Image Compression: Sparse Coding vs. Bottleneck Autoencoders
• Pairwise Ising model analysis of human cortical neuron recordings
• SEGMENT3D: A Web-based Application for Collaborative Segmentation of 3D images used in the Shoot Apical Meristem
• Data-driven Feature Sampling for Deep Hyperspectral Classification and Segmentation
• CANDiS: Coupled & Attention-Driven Neural Distant Supervision
• New asymptotic properties for $M$-estimators
• Probabilistic Couplings for Probabilistic Reasoning
• The Error Probability of Random Fourier Features is Dimensionality Independent
• Audiovisual Analytics Vocabulary and Ontology (AAVO): initial core and example expansion
• Strong Ramsey Games in Unbounded Time
• Edge-Based Wedge Sampling to Estimate Triangle Counts in Very Large Graphs
• Improving Deep Learning by Inverse Square Root Linear Units (ISRLUs)
• Sparse recovery using the preservability of the null space property under random measurements
• A Single-Channel Architecture for Algebraic Integer Based 8$\times$8 2-D DCT Computation
• Optimizing Caching Policy at Base Stations by Exploiting User Preference and Spatial Locality
• Variance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision Processes
• Near-Optimal Straggler Mitigation for Distributed Gradient Methods
• Counting partitions of a fixed genus
• Zeroth Order Nonconvex Multi-Agent Optimization over Networks
• PoseTrack: A Benchmark for Human Pose Estimation and Tracking
• Flexible Multi-Group Single-Carrier Modulation: Optimal Subcarrier Grouping and Rate Maximization
• Online linear optimization with the log-determinant regularizer
• Deterministic Approximate Methods for Maximum Consensus Robust Fitting
• Deep Learning for Accelerated Ultrasound Imaging
• Flow Analysis in High Energy Nuclear Collision – A New Approach of Chaos-based Complex Network Perspective
• On distance r-dominating and 2r-independent sets in sparse graphs
• Declarative vs Rule-based Control for Flocking Dynamics
• Online Learning of Power Transmission Dynamics
• Estimating the coefficients of variation of Freundlich parameters with weighted least squares analysis
• Single Iteration Conditional Based DSE Considering Spatial and Temporal Correlation
• A note on faithful coupling of Markov chains
• Convolutional neural networks on irregular domains through approximate translations on inferred graphs
• Generating global network structures by triad types
• Distributional Reinforcement Learning with Quantile Regression
• Group Fairness in Multiwinner Voting
• Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network
• Transform-Invariant Non-Parametric Clustering of Covariance Matrices and its Application to Unsupervised Joint Segmentation and Action Discovery
• Recovery of Structured Signals with Prior Information via Maximizing Correlation
• A lower bound on the acyclic matching number of subcubic graphs
• New type of chimera and mutual synchronization of spatiotemporal structures in two coupled ensembles of nonlocally interacting chaotic maps
• Time crystal platform: from quasi-crystal structures in time to systems with exotic interactions
• Edge-as-a-Service: Towards Distributed Cloud Architectures
• External Memory Pipelining Made Easy With TPIE
• On modeling vagueness and uncertainty in data-to-text systems through fuzzy sets
• SceneFlowFields: Dense Interpolation of Sparse Scene Flow Correspondences
• On distance and Laplacian matrices of trees with matrix weights
• An efficient SAT formulation for learning multiple criteria non-compensatory sorting rules from examples
• On the Optimal Reconstruction of Partially Observed Functional Data
• Image matting with normalized weight and semi-supervised learning
• Bayesian Pairwise Estimation Under Dependent Informative Sampling
• Lyndon Array Construction during Burrows-Wheeler Inversion
• Hyperopic Cops and Robbers
• Incorporating Reality into Social Choice
• Beyond Finite Layer Neural Networks: Bridging Deep Architectures and Numerical Differential Equations
• RRT-CoLearn: towards kinodynamic planning without numerical trajectory optimization
• Quantifying the Estimation Error of Principal Components
• Bitangents of non-smooth tropical quartics
• Gaussian fluctuations for high-dimensional random projections of $\ell_p^n$-balls
• On the density of the odd values of the partition function, II: An infinite conjectural framework
• Towards a new paradigm for assistive technology at home: research challenges, design issues and performance assessment
• SGD Learns Over-parameterized Networks that Provably Generalize on Linearly Separable Data
• Combining Aspects of Genetic Algorithms with Weighted Recommender Hybridization
• High-Quality Facial Photo-Sketch Synthesis Using Multi-Adversarial Networks
• A general representation of delta-normal sets to sublevels of convex functions
• Enhanced Biologically Inspired Model for Image Recognition Based on a Novel Patch Selection Method with Moment
• Orbitopal fixing for the full orbitope and application to the Unit Commitment Problem
• Dual Path Networks for Multi-Person Human Pose Estimation
• Detection and Analysis of Human Emotions through Voice and Speech Pattern Processing
• New Methods for Metadata Extraction from Scientific Literature
• Polar Coding for the Cognitive Interference Channel with Confidential Messages
• An integral quadratic constraint framework for real-time steady-state optimization of linear time-invariant systems
• On denoising modulo 1 samples of a function
• BridgeNets: Student-Teacher Transfer Learning Based on Recursive Neural Networks and its Application to Distant Speech Recognition
• Unified Functorial Signal Representation III: Foundations, Redundancy, $L^0$ and $L^2$ functors
• Not-So-Random Features
• Metastability of hard-core dynamics on bipartite graphs
• Minimal Reachability is Hard To Approximate
• Optimized quantum f-divergences and data processing
• Sequential Empirical Coordination Under an Output Entropy Constraint

AnalytiXon

~ Broaden your Horizon

Whats new on arXiv

Like this:

Leave a ReplyCancel reply

Share this:

Like this:

Leave a ReplyCancel reply

Discover more from AnalytiXon