Whats new on arXiv

Boundary Optimizing Network (BON)

Despite all the success that deep neural networks have seen in classifying certain datasets, the challenge of finding optimal solutions that generalize well still remains. In this paper, we propose the Boundary Optimizing Network (BON), a new approach to generalization for deep neural networks when used for supervised learning. Given a classification network, we propose to use a collaborative generative network that produces new synthetic data points in the form of perturbations of original data points. In this way, we create a data support around each original data point which prevents decision boundaries to pass too close to the original data points, i.e. prevents overfitting. To prevent catastrophic forgetting during training, we propose to use a variation of Memory Aware Synapses to optimize the generative networks. On the Iris dataset, we show that the BON algorithm creates better decision boundaries when compared to a network regularized by the popular dropout scheme.

Stable Marriage with Multi-Modal Preferences

We introduce a generalized version of the famous Stable Marriage problem, now based on multi-modal preference lists. The central twist herein is to allow each agent to rank its potentially matching counterparts based on more than one ‘evaluation mode’ (e.g., more than one criterion); thus, each agent is equipped with multiple preference lists, each ranking the counterparts in a possibly different way. We introduce and study three natural concepts of stability, investigate their mutual relations and focus on computational complexity aspects with respect to computing stable matchings in these new scenarios. Mostly encountering computational hardness (NP-hardness), we can also spot few islands of tractability and make a surprising connection to the \textsc{Graph Isomorphism} problem.

Sequential Preference-Based Optimization

Many real-world engineering problems rely on human preferences to guide their design and optimization. We present PrefOpt, an open source package to simplify sequential optimization tasks that incorporate human preference feedback. Our approach extends an existing latent variable model for binary preferences to allow for observations of equivalent preference from users.

Lifelong Learning for Sentiment Classification

This paper proposes a novel lifelong learning (LL) approach to sentiment classification. LL mimics the human continuous learning process, i.e., retaining the knowledge learned from past tasks and use it to help future learning. In this paper, we first discuss LL in general and then LL for sentiment classification in particular. The proposed LL approach adopts a Bayesian optimization framework based on stochastic gradient descent. Our experimental results show that the proposed method outperforms baseline methods significantly, which demonstrates that lifelong learning is a promising research direction.

Less is More: Culling the Training Set to Improve Robustness of Deep Neural Networks

Deep neural networks are vulnerable to adversarial examples. Prior defenses attempted to make deep networks more robust by either improving the network architecture or adding adversarial examples into the training set, with their respective limitations. We propose a new direction. Motivated by recent research that shows that outliers in the training set have a high negative influence on the trained model, our approach makes the model more robust by detecting and removing outliers in the training set without modifying the network architecture or requiring adversarial examples. We propose two methods for detecting outliers based on canonical examples and on training errors, respectively. After removing the outliers, we train the classifier with the remaining examples to obtain a sanitized model. Our evaluation shows that the sanitized model improves classification accuracy and forces the attacks to generate adversarial examples with higher distortions. Moreover, the Kullback-Leibler divergence from the output of the original model to that of the sanitized model allows us to distinguish between normal and adversarial examples reliably.

Convexification of Neural Graph

Traditionally, most complex intelligence architectures are extremely non-convex, which could not be well performed by convex optimization. However, this paper decomposes complex structures into three types of nodes: operators, algorithms and functions. Further, iteratively propagating from node to node along edge, we prove that ‘regarding the neural graph without triangles, it is nearly convex in each variable, when the other variables are fixed.’ In fact, the non-convex properties stem from triangles and functions, which could be transformed to be convex with our proposed \textit{\textbf{convexification inequality}}. In conclusion, we generally depict the landscape for the objective of neural graph and propose the methodology to convexify neural graph.

Denotation Extraction for Interactive Learning in Dialogue Systems

This paper presents a novel task using real user data obtained in human-machine conversation. The task concerns with denotation extraction from answer hints collected interactively in a dialogue. The task is motivated by the need for large amounts of training data for question answering dialogue system development, where the data is often expensive and hard to collect. Being able to collect denotation interactively and directly from users, one could improve, for example, natural understanding components on-line and ease the collection of the training data. This paper also presents introductory results of evaluation of several denotation extraction models including attention-based neural network approaches.

An efficient K -means clustering algorithm for massive data

The analysis of continously larger datasets is a task of major importance in a wide variety of scientific fields. In this sense, cluster analysis algorithms are a key element of exploratory data analysis, due to their easiness in the implementation and relatively low computational cost. Among these algorithms, the K -means algorithm stands out as the most popular approach, besides its high dependency on the initial conditions, as well as to the fact that it might not scale well on massive datasets. In this article, we propose a recursive and parallel approximation to the K -means algorithm that scales well on both the number of instances and dimensionality of the problem, without affecting the quality of the approximation. In order to achieve this, instead of analyzing the entire dataset, we work on small weighted sets of points that mostly intend to extract information from those regions where it is harder to determine the correct cluster assignment of the original instances. In addition to different theoretical properties, which deduce the reasoning behind the algorithm, experimental results indicate that our method outperforms the state-of-the-art in terms of the trade-off between number of distance computations and the quality of the solution obtained.

On variance estimation for Bayesian variable selection

Consider the problem of high dimensional variable selection for the Gaussian linear model when the unknown error variance is also of interest. In this paper, we argue that the use conjugate continuous shrinkage priors for Bayesian variable selection can have detrimental consequences for such error variance estimation. Instead, we recommend the use of priors which treat the regression coefficients and error variance as independent a priori. We revisit the canonical reference for invariant priors, Jeffreys (1961), and highlight a caveat with their use that Jeffreys himself noted. For the case study of Bayesian ridge regression, we demonstrate that these scale-invariant priors severely underestimate the variance. More generally, we discuss how these priors also interfere with the mechanics of the Bayesian global-local shrinkage framework. With these insights, we extend the Spike-and-Slab Lasso of Rockova and George (2016) to the unknown variance case, using an independent prior for the error variance. Our procedure outperforms both alternative penalized likelihood methods and the fixed variance case on simulated data.

• Classical Discrete Time Crystals
• Exploiting random lead times for significant inventory cost savings
• Towards General Distributed Resource Selection
• Quiddity sequences for $\mathrm{SL}_3$-frieze patterns
• Evorus: A Crowd-powered Conversational Assistant Built to Automate Itself Over Time
• Violable Contracts and Governance for Blockchain Applications
• A Unified Enumeration of 1-dimension Garden Algebras and Valise Adinkras
• Graph-Based Radio Resource Management for Vehicular Networks
• Generative Sensing: Transforming Unreliable Sensor Data for Reliable Recognition
• Towards Multi-Object Detection and Tracking in Urban Scenario under Uncertainties
• Term Relevance Feedback for Contextual Named Entity Retrieval
• Joint Resource Allocation and Antenna Selection In the Uplink of OFDMA Networks
• Traveling salesman problem across dense cities
• Minimum spanning trees across dense cities
• Mass-structure of weighted real trees
• Novel Impossibility Results for Group-Testing
• Modeling urbanization patterns with generative adversarial networks
• Duality of Channel Encoding and Decoding – Part II: Rate-1 Non-binary Convolutional Codes
• End-to-end detection-segmentation network with ROI convolution
• Near Maximum Likelihood Decoding with Deep Learning
• Brain MRI Super Resolution Using 3D Deep Densely Connected Neural Networks
• Data Augmentation for Brain-Computer Interfaces: Analysis on Event-Related Potentials Data
• Gradient Method in Hilbert-Besov Spaces for the Optimal Control of Parabolic Free Boundary Problems
• Modeling sepsis progression using hidden Markov models
• Enhancing Performance of Random Caching in Large-Scale Wireless Networks with Multiple Receive Antennas
• Improved Capacity Upper Bounds for the Discrete-Time Poisson Channel
• Fusion of ANN and SVM Classifiers for Network Attack Detection
• Semiconservative random walks in weak sense
• Equilibrium problems on Riemannian manifolds with applications
• SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis
• Compressing Deep Neural Networks: A New Hashing Pipeline Using Kac’s Random Walk Matrices
• TextBoxes++: A Single-Shot Oriented Scene Text Detector
• GIFT: Guided and Interpretable Factorization for Tensors – An Application to Large-Scale Multi-platform Cancer Analysis
• Role of short-range order in manipulating light absorption in disordered media
• Practical Quantum Appointment Scheduling
• Restricted sum formula for finite and symmetric multiple zeta values
• Adversarial Spheres
• Rogue Signs: Deceiving Traffic Sign Recognition with Malicious Ads and Logos
• Minimum Throughput Maximization in UAV-Aided Wireless Powered Communication Networks
• UAV-Aided Wireless Communication Designs With Propulsion Energy Limitations
• Dynamic Pricing and Energy Management Strategy for EV Charging Stations under Uncertainties
• Spectral Radius of $\{0, 1\}$-Tensor with Prescribed Number of Ones
• Better and Simpler Error Analysis of the Sinkhorn-Knopp Algorithm for Matrix Scaling
• Balanced Truncation Model Reduction of a Nonlinear Cable-Mass PDE System with Interior Damping
• Tight Bounds on the Round Complexity of the Distributed Maximum Coverage Problem
• Dendritic-Inspired Processing Enables Bio-Plausible STDP in Compound Binary Synapses
• Aviation Time Minimization of UAV for Data Collection over Wireless Sensor Networks
• Recoverability for Holevo’s just-as-good fidelity
• DeepTraffic: Driving Fast through Dense Traffic with Deep Reinforcement Learning
• Unifying particle-based and continuum models of hillslope evolution with a probabilistic scaling technique
• Adaptive Boolean Monotonicity Testing in Total Influence Time
• Test Error Estimation after Model Selection Using Validation Error
• k-connectivity of Random Graphs and Random Geometric Graphs in Node Fault Model
• Novel Methods for Enhancing the Performance of Genetic Algorithms
• Beam domain secure transmission for massive MIMO communications
• Biomedical Question Answering via Weighted Neural Network Passage Retrieval
• Data assimilation and parameter estimation for a multiscale stochastic system with alpha-stable Levy noise
• Partial regularity for solutions to subelliptic eikonal equations
• Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes
• Chemical order in Ge-Ga-Sb-Se glasses
• Scalable high-resolution forecasting of sparse spatiotemporal events with kernel methods: a winning solution to the NIJ ‘Real-Time Crime Forecasting Challenge’
• Channel Estimation with Systematic Polar Codes
• Linear Codes for Broadcasting with Noisy Side Information
• Uniform decomposition of probability measures: quantization, classification, rate of convergence
• Multi-Terminal Codes Using Constrained-Random-Number Generators
• Generalized Fano-Type Inequality for Countably Infinite Systems with List-Decoding
• Analysis of Massive MIMO and Base Station Cooperation in an Indoor Scenario
• Symbol-by-Symbol Maximum Likelihood Detection for Cooperative Molecular Communication
• CANDY: Conditional Adversarial Networks based Fully End-to-End System for Single Image Haze Removal
• What did Ryser Conjecture?
• Spatial Lattice Modulation for MIMO Systems
• 99\% Revenue via Enhanced Competition
• A Stitch in Time Saves Nine — SPARQL querying of Property Graphs using Gremlin Traversals
• The DMT classification of real and quaternionic lattice codes
• Data Augmentation by Pairing Samples for Images Classification
• An Improved Analysis of Least Squares Superposition Codes with Bernoulli Dictionary
• Deep Gaussian Processes with Decoupled Inducing Inputs
• Penultimate modeling of spatial extremes: statistical inference for max-infinitely divisible processes
• Adversarial Deep Learning for Robust Detection of Binary Encoded Malware
• A method for Bayesian regression modelling of composition data
• Bayesian Fitting of Dirichlet Type I and II Distributions
• Quasi-shuffle algebras and renormalisation of rough differential equations
• Asynchronous distributed algorithm for seeking generalized Nash equilibria
• The set of vertices with positive curvature in a planar graph with nonnegative curvature
• Global fluctuations for 1D log-gas dynamics. (2) Covariance kernel and support
• Search on Secondary Attributes in Geo-Distributed Systems
• Power spectral density of a single Brownian trajectory: What one can and cannot learn from it
• Ascents in Non-Negative Lattice Paths
• Exact asymptotics for a multi-timescale model
• Inverse images of stable Lévy processes
• Development of a spectral source inverse model by using generalized polynomial chaos
• Nonconvex Lagrangian-Based Optimization: Monitoring Schemes and Global Convergence
• Energy and Air Quality Management in a Subway Station using Stochastic Dynamic Optimization
• Pattern selection in a ring of Kuramoto oscillators
• Topical Stance Detection for Twitter: A Two-Phase LSTM Model Using Attention
• EBIC: an artificial intelligence-based parallel biclustering algorithm for pattern discovery
• Placement Delivery Array Design for Combination Networks with Edge Caching
• Meta-Tracker: Fast and Robust Online Adaptation for Visual Object Trackers
• Sales forecasting and risk management under uncertainty in the media industry
• Deconvolving RNA Base Pairing Signals
• Probabilistic Prognostic Estimates of Survival in Metastatic Cancer Patients (PPES-Met) Utilizing Free-Text Clinical Narratives
• Characterizing Granular Networks Using Topological Metrics
• Secure Communication over 1-2-1 Networks
• Verticalization of bacterial biofilms
• Multi-threaded Sparse Matrix-Matrix Multiplication for Many-Core and GPU Architectures
• Capacity Results for Finite-Field X-Channels with Delayed CSIT
• A one-phase interior point method for nonconvex optimization
• Asymmetry Hurts: Private Information Retrieval Under Asymmetric Traffic Constraints
• Quadrature Compound: An approximating family of distributions

AnalytiXon

~ Broaden your Horizon

Whats new on arXiv

Like this:

Leave a ReplyCancel reply

Share this:

Like this:

Leave a ReplyCancel reply

Discover more from AnalytiXon