AdapterNet – learning input transformation for domain adaptation

Deep neural networks have demonstrated impressive performance in various machine learning tasks. However, they are notoriously sensitive to changes in data distribution. Often, even a slight change in the distribution can lead to drastic performance reduction. Artificially augmenting the data may help to some extent, but in most cases, fails to achieve model invariance to the data distribution. Some examples where this sub-class of domain adaptation can be valuable are various imaging modalities such as thermal imaging, X-ray, ultrasound, and MRI, where changes in acquisition parameters or acquisition device manufacturer will result in different representation of the same input. Our work shows that standard finetuning fails to adapt the model in certain important cases. We propose a novel method of adapting to a new data source, and demonstrate near perfect adaptation on a customized ImageNet benchmark.

Classification Stability for Sparse-Modeled Signals

Despite their impressive performance, deep convolutional neural networks (CNNs) have been shown to be sensitive to small adversarial perturbations. These nuisances, which one can barely notice, are powerful enough to fool sophisticated and well performing classifiers, leading to ridiculous misclassification results. In this paper we analyze the stability of state-of-the-art classification machines to adversarial perturbations, where we assume that the signals belong to the (possibly multi-layer) sparse representation model. We start with convolutional sparsity and then proceed to its multi-layered version, which is tightly connected to CNNs. Our analysis links between the stability of the classification to noise and the underlying structure of the signal, quantified by the sparsity of its representation under a fixed dictionary. Our claims can be translated to a practical regularization term that provides a new interpretation to the robustness of Parseval Networks. Also, the proposed theory justifies the increased stability of the recently emerging layered basis pursuit architectures, when compared to the classic forward-pass.

Human-in-the-Loop Interpretability Prior

We often desire our models to be interpretable as well as accurate. Prior work on optimizing models for interpretability has relied on easy-to-quantify proxies for interpretability, such as sparsity or the number of operations required. In this work, we optimize for interpretability by directly including humans in the optimization loop. We develop an algorithm that minimizes the number of user studies to find models that are both predictive and interpretable and demonstrate our approach on several data sets. Our human subjects results show trends towards different proxy notions of interpretability on different datasets, which suggests that different proxies are preferred on different tasks.

Absolutely Zero Evidence

Statistical analysis is often used to evaluate the evidence for or against scientific hypotheses, and various statistics (e.g., p-values, likelihood ratios, Bayes factors) are interpreted as measures of evidence strength. Here I consider evidence measurement from the point of view of representational measurement theory, and argue that familiar evidence statistics do not conform to any legitimate measurement scale type. I then consider the notion of an absolute scale for evidence measurement, in a sense to be defined, focusing particularly on the notion of absolute 0 evidence, which turns out to be something other than what one might have expected.

Classification with imperfect training labels

We study the effect of imperfect training data labels on the performance of classification methods. In a general setting, where the probability that an observation in the training dataset is mislabelled may depend on both the feature vector and the true label, we bound the excess risk of an arbitrary classifier trained with imperfect labels in terms of its excess risk for predicting a noisy label. This reveals conditions under which a classifier trained with imperfect labels remains consistent for classifying uncorrupted test data points. Furthermore, under stronger conditions, we derive detailed asymptotic properties for the popular k-nearest neighbour (knn), Support Vector Machine (SVM) and Linear Discriminant Analysis (LDA) classifiers. One consequence of these results is that the knn and SVM classifiers are robust to imperfect training labels, in the sense that the rate of convergence of the excess risks of these classifiers remains unchanged; in fact, it even turns out that in some cases, imperfect labels may improve the performance of these methods. On the other hand, the LDA classifier is shown to be typically inconsistent in the presence of label noise unless the prior probabilities of each class are equal. Our theoretical results are supported by a simulation study.

Elastic Functional Principal Component Regression

We study regression using functional predictors in situations where these functions contain both phase and amplitude variability. In other words, the functions are misaligned due to errors in time measurements, and these errors can significantly degrade both model estimation and prediction performance. The current techniques either ignore the phase variability, or handle it via pre-processing, i.e., use an off-the-shelf technique for functional alignment and phase removal. We develop a functional principal component regression model which has comprehensive approach in handling phase and amplitude variability. The model utilizes a mathematical representation of the data known as the square-root slope function. These functions preserve the \mathbf{L}^2 norm under warping and are ideally suited for simultaneous estimation of regression and warping parameters. Using both simulated and real-world data sets, we demonstrate our approach and evaluate its prediction performance relative to current models. In addition, we propose an extension to functional logistic and multinomial logistic regression

Representational Power of ReLU Networks and Polynomial Kernels: Beyond Worst-Case Analysis

There has been a large amount of interest, both in the past and particularly recently, into the power of different families of universal approximators, e.g. ReLU networks, polynomials, rational functions. However, current research has focused almost exclusively on understanding this problem in a worst-case setting, e.g. bounding the error of the best infinity-norm approximation in a box. In this setting a high-degree polynomial is required to even approximate a single ReLU. However, in real applications with high dimensional data we expect it is only important to approximate the desired function well on certain relevant parts of its domain. With this motivation, we analyze the ability of neural networks and polynomial kernels of bounded degree to achieve good statistical performance on a simple, natural inference problem with sparse latent structure. We give almost-tight bounds on the performance of both neural networks and low degree polynomials for this problem. Our bounds for polynomials involve new techniques which may be of independent interest and show major qualitative differences with what is known in the worst-case setting.

Learning Under Distributed Features

This work studies the problem of learning under both large data and large feature space scenarios. The feature information is assumed to be spread across agents in a network, where each agent observes some of the features. Through local cooperation, the agents are supposed to interact with each other to solve the inference problem and converge towards the global minimizer of the empirical risk. We study this problem exclusively in the primal domain, and propose new and effective distributed solutions with guaranteed convergence to the minimizer. This is achieved by combining a dynamic diffusion construction, a pipeline strategy, and variance-reduced techniques. Simulation results illustrate the conclusions.

Kernel embedding of maps for sequential Bayesian inference: The variational mapping particle filter

In this work, a novel sequential Monte Carlo filter is introduced which aims at efficient sampling of high-dimensional state spaces with a limited number of particles. Particles are pushed forward from the prior to the posterior density using a sequence of mappings that minimizes the Kullback-Leibler divergence between the posterior and the sequence of intermediate densities. The sequence of mappings represents a gradient flow. A key ingredient of the mappings is that they are embedded in a reproducing kernel Hilbert space, which allows for a practical and efficient algorithm. The embedding provides a direct means to calculate the gradient of the Kullback-Leibler divergence leading to quick convergence using well-known gradient-based stochastic optimization algorithms. Evaluation of the method is conducted in the chaotic Lorenz-63 system, the Lorenz-96 system, which is a coarse prototype of atmospheric dynamics, and an epidemic model that describes cholera dynamics. No resampling is required in the mapping particle filter even for long recursive sequences. The number of effective particles remains close to the total number of particles in all the experiments.

Lovasz Convolutional Networks

Semi-supervised learning on graph structured data has received significant attention with the recent introduction of graph convolution networks (GCN). While traditional methods have focused on optimizing a loss augmented with Laplacian regularization framework, GCNs perform an implicit Laplacian type regularization to capture local graph structure. In this work, we propose Lovasz convolutional network (LCNs) which are capable of incorporating global graph properties. LCNs achieve this by utilizing Lovasz’s orthonormal embeddings of the nodes. We analyse local and global properties of graphs and demonstrate settings where LCNs tend to work better than GCNs. We validate the proposed method on standard random graph models such as stochastic block models (SBM) and certain community structure based graphs where LCNs outperform GCNs and learn more intuitive embeddings. We also perform extensive binary and multi-class classification experiments on real world datasets to demonstrate LCN’s effectiveness. In addition to simple graphs, we also demonstrate the use of LCNs on hypergraphs by identifying settings where they are expected to work better than GCNs.

Hamiltonian Variational Auto-Encoder

Variational Auto-Encoders (VAEs) have become very popular techniques to perform inference and learning in latent variable models as they allow us to leverage the rich representational power of neural networks to obtain flexible approximations of the posterior of latent variables as well as tight evidence lower bounds (ELBOs). Combined with stochastic variational inference, this provides a methodology scaling to large datasets. However, for this methodology to be practically efficient, it is necessary to obtain low-variance unbiased estimators of the ELBO and its gradients with respect to the parameters of interest. While the use of Markov chain Monte Carlo (MCMC) techniques such as Hamiltonian Monte Carlo (HMC) has been previously suggested to achieve this [23, 26], the proposed methods require specifying reverse kernels which have a large impact on performance. Additionally, the resulting unbiased estimator of the ELBO for most MCMC kernels is typically not amenable to the reparameterization trick. We show here how to optimally select reverse kernels in this setting and, by building upon Hamiltonian Importance Sampling (HIS) [17], we obtain a scheme that provides low-variance unbiased estimators of the ELBO and its gradients using the reparameterization trick. This allows us to develop a Hamiltonian Variational Auto-Encoder (HVAE). This method can be reinterpreted as a target-informed normalizing flow [20] which, within our context, only requires a few evaluations of the gradient of the sampled likelihood and trivial Jacobian calculations at each iteration.

Lightweight Probabilistic Deep Networks

Even though probabilistic treatments of neural networks have a long history, they have not found widespread use in practice. Sampling approaches are often too slow already for simple networks. The size of the inputs and the depth of typical CNN architectures in computer vision only compound this problem. Uncertainty in neural networks has thus been largely ignored in practice, despite the fact that it may provide important information about the reliability of predictions and the inner workings of the network. In this paper, we introduce two lightweight approaches to making supervised learning with probabilistic deep networks practical: First, we suggest probabilistic output layers for classification and regression that require only minimal changes to existing networks. Second, we employ assumed density filtering and show that activation uncertainties can be propagated in a practical fashion through the entire network, again with minor changes. Both probabilistic networks retain the predictive power of the deterministic counterpart, but yield uncertainties that correlate well with the empirical error induced by their predictions. Moreover, the robustness to adversarial examples is significantly increased.

Bayesian Inference with Anchored Ensembles of Neural Networks, and Application to Reinforcement Learning

The use of ensembles of neural networks (NNs) for the quantification of predictive uncertainty is widespread. However, the current justification is intuitive rather than analytical. This work proposes one minor modification to the normal ensembling methodology, which we prove allows the ensemble to perform Bayesian inference, hence converging to the corresponding Gaussian Process as both the total number of NNs, and the size of each, tend to infinity. This working paper provides early-stage results in a reinforcement learning setting, analysing the practicality of the technique for an ensemble of small, finite number. Using the uncertainty estimates they produce to govern the exploration-exploitation process results in steadier, more stable learning.

Wasserstein Variational Inference

This paper introduces Wasserstein variational inference, a new form of approximate Bayesian inference based on optimal transport theory. Wasserstein variational inference uses a new family of divergences that includes both f-divergences and the Wasserstein distance as special cases. The gradients of the Wasserstein variational loss are obtained by backpropagating through the Sinkhorn iterations. This technique results in a very stable likelihood-free training method that can be used with implicit distributions and probabilistic programs. Using the Wasserstein variational inference framework, we introduce several new forms of autoencoders and test their robustness and performance against existing variational autoencoding techniques.

Retraining-Based Iterative Weight Quantization for Deep Neural Networks

Model compression has gained a lot of attention due to its ability to reduce hardware resource requirements significantly while maintaining accuracy of DNNs. Model compression is especially useful for memory-intensive recurrent neural networks because smaller memory footprint is crucial not only for reducing storage requirement but also for fast inference operations. Quantization is known to be an effective model compression method and researchers are interested in minimizing the number of bits to represent parameters. In this work, we introduce an iterative technique to apply quantization, presenting high compression ratio without any modifications to the training algorithm. In the proposed technique, weight quantization is followed by retraining the model with full precision weights. We show that iterative retraining generates new sets of weights which can be quantized with decreasing quantization loss at each iteration. We also show that quantization is efficiently able to leverage pruning, another effective model compression method. Implementation issues on combining the two methods are also addressed. Our experimental results demonstrate that an LSTM model using 1-bit quantized weights is sufficient for PTB dataset without any accuracy degradation while previous methods demand at least 2-4 bits for quantized weights.

MBA: Mini-Batch AUC Optimization

Area under the receiver operating characteristics curve (AUC) is an important metric for a wide range of signal processing and machine learning problems, and scalable methods for optimizing AUC have recently been proposed. However, handling very large datasets remains an open challenge for this problem. This paper proposes a novel approach to AUC maximization, based on sampling mini-batches of positive/negative instance pairs and computing U-statistics to approximate a global risk minimization problem. The resulting algorithm is simple, fast, and learning-rate free. We show that the number of samples required for good performance is independent of the number of pairs available, which is a quadratic function of the positive and negative instances. Extensive experiments show the practical utility of the proposed method.

Distributed Statistical Inference for Massive Data

This paper considers distributed statistical inference for general symmetric statistics %that encompasses the U-statistics and the M-estimators in the context of massive data where the data can be stored at multiple platforms in different locations. In order to facilitate effective computation and to avoid expensive communication among different platforms, we formulate distributed statistics which can be conducted over smaller data blocks. The statistical properties of the distributed statistics are investigated in terms of the mean square error of estimation and asymptotic distributions with respect to the number of data blocks. In addition, we propose two distributed bootstrap algorithms which are computationally effective and are able to capture the underlying distribution of the distributed statistics. Numerical simulation and real data applications of the proposed approaches are provided to demonstrate the empirical performance.

FairGAN: Fairness-aware Generative Adversarial Networks

Fairness-aware learning is increasingly important in data mining. Discrimination prevention aims to prevent discrimination in the training data before it is used to conduct predictive analysis. In this paper, we focus on fair data generation that ensures the generated data is discrimination free. Inspired by generative adversarial networks (GAN), we present fairness-aware generative adversarial networks, called FairGAN, which are able to learn a generator producing fair data and also preserving good data utility. Compared with the naive fair data generation models, FairGAN further ensures the classifiers which are trained on generated data can achieve fair classification on real data. Experiments on a real dataset show the effectiveness of FairGAN.

Value Propagation Networks

We present Value Propagation (VProp), a parameter-efficient differentiable planning module built on Value Iteration which can successfully be trained using reinforcement learning to solve unseen tasks, has the capability to generalize to larger map sizes, and can learn to navigate in dynamic environments. Furthermore, we show that the module enables learning to plan when the environment also includes stochastic elements, providing a cost-efficient learning system to build low-level size-invariant planners for a variety of interactive navigation problems. We evaluate on static and dynamic configurations of MazeBase grid-worlds, with randomly generated environments of several different sizes, and on a StarCraft navigation scenario, with more complex dynamics, and pixels as input.

Learning From Less Data: Diversified Subset Selection and Active Learning in Image Classification Tasks

Supervised machine learning based state-of-the-art computer vision techniques are in general data hungry and pose the challenges of not having adequate computing resources and of high costs involved in human labeling efforts. Training data subset selection and active learning techniques have been proposed as possible solutions to these challenges respectively. A special class of subset selection functions naturally model notions of diversity, coverage and representation and they can be used to eliminate redundancy and thus lend themselves well for training data subset selection. They can also help improve the efficiency of active learning in further reducing human labeling efforts by selecting a subset of the examples obtained using the conventional uncertainty sampling based techniques. In this work we empirically demonstrate the effectiveness of two diversity models, namely the Facility-Location and Disparity-Min models for training-data subset selection and reducing labeling effort. We do this for a variety of computer vision tasks including Gender Recognition, Scene Recognition and Object Recognition. Our results show that subset selection done in the right way can add 2-3% in accuracy on existing baselines, particularly in the case of less training data. This allows the training of complex machine learning models (like Convolutional Neural Networks) with much less training data while incurring minimal performance loss.

Semi-Implicit Variational Inference

Semi-implicit variational inference (SIVI) is introduced to expand the commonly used analytic variational distribution family, by mixing the variational parameter with a flexible distribution. This mixing distribution can assume any density function, explicit or not, as long as independent random samples can be generated via reparameterization. Not only does SIVI expand the variational family to incorporate highly flexible variational distributions, including implicit ones that have no analytic density functions, but also sandwiches the evidence lower bound (ELBO) between a lower bound and an upper bound, and further derives an asymptotically exact surrogate ELBO that is amenable to optimization via stochastic gradient ascent. With a substantially expanded variational family and a novel optimization algorithm, SIVI is shown to closely match the accuracy of MCMC in inferring the posterior in a variety of Bayesian inference tasks.

Core Conflictual Relationship: Text Mining to Discover What and When

Following detailed presentation of the Core Conflictual Relationship Theme (CCRT), there is the objective of relevant methods for what has been described as verbalization and visualization of data. Such is also termed data mining and text mining, and knowledge discovery in data. The Correspondence Analysis methodology, also termed Geometric Data Analysis, is shown in a case study to be comprehensive and revealing. Computational efficiency depends on how the analysis process is structured. For both illustrative and revealing aspects of the case study here, relatively extensive dream reports are used. This Geometric Data Analysis confirms the validity of CCRT method.

Differentiable Particle Filters: End-to-End Learning with Algorithmic Priors

We present differentiable particle filters (DPFs): a differentiable implementation of the particle filter algorithm with learnable motion and measurement models. Since DPFs are end-to-end differentiable, we can efficiently train their models by optimizing end-to-end state estimation performance, rather than proxy objectives such as model accuracy. DPFs encode the structure of recursive state estimation with prediction and measurement update that operate on a probability distribution over states. This structure represents an algorithmic prior that improves learning performance in state estimation problems while enabling explainability of the learned model. Our experiments on simulated and real data show substantial benefits from end-to- end learning with algorithmic priors, e.g. reducing error rates by ~80%. Our experiments also show that, unlike long short-term memory networks, DPFs learn localization in a policy-agnostic way and thus greatly improve generalization. Source code is available at https://…/differentiable-particle-filters.

OpenNMT: Neural Machine Translation Toolkit

OpenNMT is an open-source toolkit for neural machine translation (NMT). The system prioritizes efficiency, modularity, and extensibility with the goal of supporting NMT research into model architectures, feature representations, and source modalities, while maintaining competitive performance and reasonable training requirements. The toolkit consists of modeling and translation support, as well as detailed pedagogical documentation about the underlying techniques. OpenNMT has been used in several production MT systems, modified for numerous research papers, and is implemented across several deep learning frameworks.

Large-Scale Learning from Data Streams with Apache SAMOA

Apache SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams. Big data is defined as datasets whose size is beyond the ability of typical software tools to capture, store, manage, and analyze, due to the time and memory complexity. Apache SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms. It features a pluggable architecture that allows it to run on several distributed stream processing engines such as Apache Flink, Apache Storm, and Apache Samza. Apache SAMOA is written in Java and is available at under the Apache Software License version 2.0.

Model-based Pricing for Machine Learning in a Data Marketplace

Data analytics using machine learning (ML) has become ubiquitous in science, business intelligence, journalism and many other domains. While a lot of work focuses on reducing the training cost, inference runtime and storage cost of ML models, little work studies how to reduce the cost of data acquisition, which potentially leads to a loss of sellers’ revenue and buyers’ affordability and efficiency. In this paper, we propose a model-based pricing (MBP) framework, which instead of pricing the data, directly prices ML model instances. We first formally describe the desired properties of the MBP framework, with a focus on avoiding arbitrage. Next, we show a concrete realization of the MBP framework via a noise injection approach, which provably satisfies the desired formal properties. Based on the proposed framework, we then provide algorithmic solutions on how the seller can assign prices to models under different market scenarios (such as to maximize revenue). Finally, we conduct extensive experiments, which validate that the MBP framework can provide high revenue to the seller, high affordability to the buyer, and also operate on low runtime cost.

Deep Learning under Privileged Information Using Heteroscedastic Dropout
Semantically-informed distance and similarity measures for paraphrase plagiarism identification
Beyond admissibility: Dominance between chains of strategies
How Does Batch Normalization Help Optimization (No, It Is Not About Internal Covariate Shift)
Automatic Identification of Arabic expressions related to future events in Lebanon’s economy
Polyglot Semantic Role Labeling
Deep Neural Networks for Swept Volume Prediction Between Configurations
Observe and Look Further: Achieving Consistent Performance on Atari
Playing hard exploration games by watching YouTube
Mirror, Mirror, on the Wall, Who’s Got the Clearest Image of Them All – A Tailored Approach to Single Image Reflection Removal
A Line-Search Algorithm Inspired by the Adaptive Cubic Regularization Framework and Complexity Analysis
Non-overlapping community detection
Focal onset seizure prediction using convolutional networks
Adversarial Regularizers in Inverse Problems
On gradient regularizers for MMD GANs
Entrainment profiles: Comparison by gender, role, and feature set
Winning Models for GPA, Grit, and Layoff in the Fragile Families Challenge
An exact solution for choosing the largest measurement from a sample drawn from an uniform distribution
Optimisation and Illumination of a Real-world Workforce Scheduling and Routing Application via Map-Elites
Braidless Weights, Minimal Representatives and the Weyl Group Multiple Dirichlet Series
The Actor Search Tree Critic (ASTC) for Off-Policy POMDP Learning in Medical Decision Making
Visually Grounded, Situated Learning in Neural Models
Lightly-supervised Representation Learning with Global Interpretability
Forward Amortized Inference for Likelihood-Free Variational Marginalization
Connected but Segregated: Social Networks in Rural Villages
Decision Making of Maximizers and Satisficers Based on Collaborative Explanations
CoupleNet: Paying Attention to Couples with Coupled Attention for Relationship Recommendation
aipred: A Flexible R Package Implementing Methods for Predicting Air Pollution
Neural Network Aided Decoding for Physical-Layer Network Coding Random Access
Low Resolution Face Recognition in the Wild
Learning to Transcribe by Ear
Novel and Improved Stage Estimation in Parkinson’s Disease using Clinical Scales and Machine Learning
Probabilistic nilpotence in infinite groups
Face Recognition in Low Quality Images: A Survey
You Say ‘What’, I Hear ‘Where’ and ‘Why’ $\text{—}$ (Mis-)Interpreting SQL to Derive Fine-Grained Provenance
Large Multiuser MIMO Detection: Algorithms and Architectures
(3a:a)-list-colorability of embedded graphs of girth at least five
Capturing Variabilities from Computed Tomography Images with Generative Adversarial Networks
A Practical Method of Estimation and Inference for Policy-Relevant Treatment Effects
Efficient Bayesian Inference for a Gaussian Process Density Model
Rice Classification Using Hyperspectral Imaging and Deep Convolutional Neural Network
Recovering short secret keys of RLCE in polynomial time
Enabling LTE RACH Collision Multiplicity Detection via Machine Learning
Human vs Automatic Metrics: on the Importance of Correlation Design
Entity Linking in 40 Languages using MAG
AMR Dependency Parsing with a Typed Semantic Algebra
An Analytic Solution to the Inverse Ising Problem in the Tree-reweighted Approximation
Virtuously Safe Reinforcement Learning
Identifying Ketamine Responses in Treatment-Resistant Depression Using a Wearable Forehead EEG
On the chromatic number of generalized Kneser hypergraphs
A Geometric Approach for Computing Tolerance Bounds for Elastic Functional Data
Phase field approximations of branched transportation problems
On different notions of calibrations for minimal partitions and minimal networks in $\mathbb{R}^2$
A novel channel pruning method for deep neural network compression
Robust Tumor Localization with Pyramid Grad-CAM
Performance Benchmarking and Optimizing Hyperledger Fabric Blockchain Platform
A variational approach to the quasistatic limit of viscous dynamic evolutions in finite dimension
An Elementary Approach To Uniform In Time Propagation Of Chaos
Uniform regret bounds over $R^d$ for the sequential linear regression problem with the square loss
Automating Personnel Rostering by Learning Constraints Using Tensors
Webpage Saliency Prediction with Two-stage Generative Adversarial Networks
‘How to rate a video game ‘ – A prediction system for video games based on multimodal information
How to Blend a Robot within a Group of Zebrafish: Achieving Social Acceptance through Real-time Calibration of a Multi-level Behavioural Model
Lévy’s martingale characterization and reflection principle of $G$-Brownian motion
A transformed stochastic Euler scheme for multidimensional transmission PDE
Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information
Properties of interaction networks, structure coefficients, and benefit-to-cost ratios
Offloading of Users in NOMA-HetNet Using Repulsive Point Process
CocoNet: A deep neural network for mapping pixel coordinates to color values
Quantum-inspired Complex Word Embedding
Fully Statistical Neural Belief Tracking
Aproximação do Equilíbrio e Tempos Exponenciais para o Passeio Aleatório no Hipercubo
Uncertainty Gated Network for Land Cover Segmentation
Bayesian identification of sound sources with the Helmholtz equation
Pointly-Supervised Action Localization
The Cohomology and Laplacians of Weighted Hypergraphs and Applications
CNN-Based Detection of Generic Constrast Adjustment with JPEG Post-processing
Neural networks for stock price prediction
Multiple-Access Channel with Independent Sources: Error Exponent Analysis
Stationarity and ergodicity of vector STAR models
Trust-based dynamic linear threshold models for non-competitive and competitive influence propagation
Scheduling under dynamic speed-scaling for minimizing weighted completion time and energy consumption
Unsupervised detection of diachronic word sense evolution
Learning Data Augmentation for Brain Tumor Segmentation with Coarse-to-Fine Generative Adversarial Networks
Flag numbers and floating bodies
Partition problems in high dimensional boxes
Rank Based Approach on Graphs with Structured Neighborhood
DynGEM: Deep Embedding Method for Dynamic Graphs
Improved Mixed-Example Data Augmentation
On Structural Properties of Feedback Optimal Control of Traffic Flow under the Cell Transmission Model
On the Hyper Zagreb index of certain generalized thorn graphs
Linearized wave turbulence convergence results for three-wave systems
Flexible Cholesky GARCH model with time dependent coefficients
Multi-hop Inference for Sentence-level TextGraphs: How Challenging is Meaningfully Combining Information for Science Question Answering
Disentangling by Partitioning: A Representation Learning Framework for Multimodal Sensory Data
Error Bounds on a Mixed Entropy Inequality
Statistical mechanical analysis of sparse linear regression as a variable selection problem
Iterative Statistical Linear Regression for Gaussian Smoothing in Continuous-Time Non-linear Stochastic Dynamic Systems
The deficit in an entropic inequality
Succinct data structure for dynamic trees with faster queries
Hierarchical One Permutation Hashing: Efficient Multimedia Near Duplicate Detection
Weak Supermodularity Assists Submodularity-based Approaches to Non-convex Constrained Optimization
Microscopy Cell Segmentation via Convolutional LSTM Networks
Computing degree of determinant via discrete convex optimization on Euclidean building
On Robust Trimming of Bayesian Network Classifiers
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
Explicit construction of RIP matrices is Ramsey-hard
Review of Applications of Generalized Regression Neural Networks in Identification and Control of Dynamic Systems
The Secure Two-Receiver Broadcast Channel With One-Sided Receiver Side Information
Table-to-Text: Describing Table Region with Natural Language
Currency exchange prediction using machine learning, genetic algorithms and technical analysis
Getting to Know Low-light Images with The Exclusively Dark Dataset
Distilling Knowledge for Search-based Structured Prediction
Video Anomaly Detection and Localization via Gaussian Mixture Fully Convolutional Variational Autoencoder
Unsupervised Alignment of Embeddings with Wasserstein Procrustes
Non-rigid Reconstruction with a Single MovingRGB-D Camera
Bi-Directional Neural Machine Translation with Synthetic Parallel Data
Automatic Exposure Compensation for Multi-Exposure Image Fusion
Wireless Localization for mmWave Networks in Urban Environments
Toward Ka Band Acoustics: Lithium Niobate Asymmetrical Mode Piezoelectric MEMS Resonators
Statistical Recurrent Models on Manifold valued Data
A parallel implementation of the covariance matrix adaptation evolution strategy
Controllability of Ensemble Formation System over Digraph
CapsNet comparative performance evaluation for image classification
Cybersecurity in Distributed and Fully-Decentralized Optimization: Distortions, Noise Injection, and ADMM
Graph-based Filtering of Out-of-Vocabulary Words for Encoder-Decoder Models
Optimal transportation between unequal dimensions
GESF: A Universal Discriminative Mapping Mechanism for Graph Representation Learning
Analysis of vibrational normal modes for Coulomb clusters
Reachability Analysis for Robustness Evaluation of the Sit-To-Stand Movement for Powered Lower Limb Orthoses
Towards computational fluorescence microscopy: Machine learning-based integrated prediction of morphological and molecular tumor profiles
A short proof of Brooks’ theorem
A State Space Technique for Wildlife Position Estimation Using Non-Simultaneous Signal Strength Measurements
Strongly polynomial efficient approximation scheme for segmentation
A visual approach for age and gender identification on Twitter
On the asymptotic behaviour of the Aragon Artacho-Campoy algorithm
Confidence Prediction for Lexicon-Free OCR
Unsupervised Learning of Artistic Styles with Archetypal Style Analysis
Exemplar Guided Unsupervised Image-to-Image Translation
NengoDL: Combining deep learning and neuromorphic modelling methods
Quantum generalizations of the polynomial hierarchy with applications to QMA(2)
Modeling the residential electricity consumption within a restructured power market
Statistical Methods in Computed Tomography Image Estimation
Speeding up complex multivariate data analysis in Borexino with parallel computing based on Graphics Processing Unit
Object Counting with Small Datasets of Large Images
Adding New Tasks to a Single Network with Weight Trasformations using Binary Masks
BlockCNN: A Deep Network for Artifact Removal and Image Compression
Syntactic Dependency Representations in Neural Relation Classification
Inference for ergodic diffusions plus noise
Discrete Linear Canonical Transform Based on Hyperdifferential Operators
GenAttack: Practical Black-box Attacks with Gradient-Free Optimization
On the sizes of $(k,l)$-edge-maximal $r$-uniform hypergraphs
Asymptotic analysis of the expected utility maximization problem with respect to perturbations of the numéraire
The GraftalLace Cellular Automaton
A Bismut-Elworthy-Li Formula for Singular SDE’s Driven by a Fractional Brownian Motion and Applications to Rough Volatility Modeling
Erdős-Lovász Tihany Conjecture for graphs with forbidden holes
Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation
Distributed Stochastic Gradient Tracking Methods
Technical Report: Optimistic Execution in Key-Value Store
ALZA: An Efficient Hybrid Decentralized Payment System
Refining Source Representations with Relation Networks for Neural Machine Translation
End-to-End Speech-Driven Facial Animation with Temporal GANs