**Few-shot Autoregressive Density Estimation: Towards Learning to Learn Distributions**

**A Self-Training Method for Semi-Supervised GANs**

**Similarity-based Multi-label Learning**

**Deep Generative Dual Memory Network for Continual Learning**

**Fully Distributed Multi-sensor Change-point Detection**

**Information-Based Optimal Subdata Selection for Big Data Linear Regression**

**Label Embedding Network: Learning Label Representation for Soft Training of Deep Networks**

**Topic Based Sentiment Analysis Using Deep Learning**

**Interpretation of Neural Networks is Fragile**

**A Hybrid Data Mining Approach for Product Complexity Analysis**

**Practical Bayesian Inference for Record Linkage**

**A Bayesian Data Augmentation Approach for Learning Deep Models**

**Stochastic Training of Graph Convolutional Networks**

**Weight Initialization of Deep Neural Networks(DNNs) using Data Statistics**

**Variational Continual Learning**

**Regularization for Deep Learning: A Taxonomy**

**Kernel Graph Convolutional Neural Networks**

**Multilinear Class-Specific Discriminant Analysis**

**Evolving Deep Convolutional Neural Networks for Image Classification**

**Tensorizing Generative Adversarial Nets**

**Transfer Learning to Learn with Multitask Neural Model Search**

**Understanding Hidden Memories of Recurrent Neural Networks**

**How deep learning works –The geometry of deep learning**

**Understanding GANs: the LQG Setting**

**Weighted entropy: basic inequalities**

**A Comprehensive Survey on Fog Computing: State-of-the-art and Research Challenges**

• When tails wag the decision: The role of distributional tails on climate impacts on decision-relevant time-scales

• Wavelet Shrinkage and Thresholding based Robust Classification for Brain Computer Interface

• One-shot and few-shot learning of word embeddings

• Probability Series Expansion Classifier that is Interpretable by Design

• Properties of the Fibonacci-sum graph

• Identifying overlapping terrorist cells from the Noordin Top actor-event network

• Spectral Graph Wavelets for Structural Role Similarity in Networks

• On Maximally Recoverable Local Reconstruction Codes

• Lower Bounds for Higher-Order Convex Optimization

• Multi-modal Aggregation for Video Classification

• Improved approximation of layout problems on random graphs

• Insights on Variance Estimation for Blocked and Matched Pairs Designs

• Hasse diagrams of non-isomorphic posets with $n$ elements, $2\leq n \leq 7,$ and the number of posets with $10$ elements, without the aid of any computer program

• A Treatise on Sucker’s Bets

• The Implicit Bias of Gradient Descent on Separable Data

• Identifying Individual Disease Dynamics in a Stochastic Multi-pathogen Model From Aggregated Reports and Laboratory Data

• Multi-level Residual Networks from Dynamical Systems View

• Bayesian Spatial Binary Regression for Label Fusion in Structural Neuroimaging

• Automated Design using Neural Networks and Gradient Descent

• Brewster anomaly in random anisotropic media

• Convolutional Neural Networks Via Node-Varying Graph Filters

• Deep Residual Learning for Small-Footprint Keyword Spotting

• Diff-DAC: Distributed Actor-Critic for Multitask Deep Reinforcement Learning

• Consistency of Lipschitz learning with infinite unlabeled data and finite labeled data

• Lower Bounds for Two-Sample Structural Change Detection in Ising and Gaussian Models

• Topology adaptive graph convolutional networks

• Combinatorial proof of an identity of Andrews–Yee

• Exploring Asymmetric Encoder-Decoder Structure for Context-based Sentence Representation Learning

• Partitioning Relational Matrices of Similarities or Dissimilarities using the Value of Information

• A Geometric Perspective on the Power of Principal Component Association Tests in Multiple Phenotype Studies

• Left-Right Skip-DenseNets for Coarse-to-Fine Object Categorization

• A Range-Doppler-Angle Estimation Method for Passive Bistatic Radar

• Minimax Rates and Efficient Algorithms for Noisy Sorting

• Blocking Probability and Spatial Throughput Characterization for Cellular-Enabled UAV Network with Directional Antenna

• Local approximation of a metapopulation’s equilibrium

• Geometric Decomposition-Based Formulation for Time Derivatives of Instantaneous Impact Point

• A Study of All-Convolutional Encoders for Connectionist Temporal Classification

• Total-Text: A Comprehensive Dataset for Scene Text Detection and Recognition

• Trainable back-propagated functional transfer matrices

• Efficient Localized Inference for Large Graphical Models

• Doppelgangers: the Ur-Operation and Posets of Bounded Height

• Cox’s proportional hazards model with a high-dimensional and sparse regression parameter

• Efficient Licence Plate Detection By Unique Edge Detection Algorithm and Smarter Interpretation Through IoT

• Channel Coherence Classification with Frame-Shifting in Massive MIMO System

• Uniform rank gradient, cost and local-global convergence

• An Ontology to support automated negotiation

• A Framework for Compressive Time-of-Flight 3D Sensing

• Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms

• Criteria for input-to-state practical stability

• Inducing Regular Grammars Using Recurrent Neural Networks

• All partitions have small parts – Gallai-Ramsey numbers of bipartite graphs

• Toward predictive machine learning for active vision

• Omnidirectional Precoding and Combining Based Synchronization for Millimeter Wave Massive MIMO Systems

• Long-Distance Loop Closure Using General Object Landmarks

• Generalized End-to-End Loss for Speaker Verification

• Speaker Diarization with LSTM

• Attention-Based Models for Text-Dependent Speaker Verification

• SeeThrough: Finding Chairs in Heavily Occluded Indoor Scenes

• A reference-searching-based algorithm for large-scale data envelopment analysis computation

• On the $α$-index of graphs with pendent paths

• ILAPF: Incremental Learning Assisted Particle Filtering

• Analytical Estimation of Scalability of Iterative Numerical Algorithms on Distributed Memory Multiprocessors

• Learning to diagnose from scratch by exploiting dependencies among labels

• Customer sojourn time in GI/G/1 feedback queue in the presence of heavy tails

• Phase Conductor on Multi-layered Attentions for Machine Comprehension

• Online Approximate Optimal Station Keeping of a Marine Craft in the Presence of a Current

• Crime incidents embedding using restricted Boltzmann machines

• Optimal Battery Participation in Frequency Regulation Markets

• Exploiting Points and Lines in Regression Forests for RGB-D Camera Relocalization

• A Dual Encoder Sequence to Sequence Model for Open-Domain Dialogue Modeling

• Object Recognition by Using Multi-level Feature Point Extraction

• Interlacement and Activities in Delta-Matroids

• Optimal designs for regression with spherical data

• Parking on transitive unimodular graphs

• Interpretable Apprenticship Learning with Temporal Logic Specifications

• Heat kernel and ergodicity of SDEs with distributional drifts

• Partial Knowledge In Embeddings

• Interaction between cluster synchronization and epidemic spread in community networks

• Hierarchical and Distributed Monitoring of Voltage Stability in Distribution Networks

• A $o(d) \cdot \text{polylog}~n$ Monotonicity Tester for Boolean Functions over the Hypergrid $[n]^d$

• Vehicle Routing Problem with Vector Profits (VRPVP) with Max-Min Criterion

• Stochastic Zeroth-order Optimization in High Dimensions

• A Novel Approach to Artistic Textual Visualization via GAN

• Smooth Sensitivity Based Approach for Differentially Private Principal Component Analysis

• Synthetic Iris Presentation Attack using iDCGAN

• Certifiable Distributional Robustness with Principled Adversarial Training

• Personalized word representations Carrying Personalized Semantics Learned from Social Network Posts

• Examining CNN representations with respect to Dataset Bias

• Secrecy Rate Maximization with Outage Constraint in Multihop Relaying Networks

• Path-Based Attention Neural Model for Fine-Grained Entity Typing

• Evaluation of Automatic Video Captioning Using Direct Assessment

• Intelligent Interference Exploitation for Heterogeneous Cellular Networks against Eavesdropping

• Automatic Knee Osteoarthritis Diagnosis from Plain Radiographs: A Deep Learning-Based Approach

• Almost Optimal Stochastic Weighted Matching With Few Queries

• Social Welfare Maximization Auction in Edge Computing Resource Allocation for Mobile Blockchain

• Regularization approaches for support vector machines with applications to biomedical data

• SDPNAL+: A Matlab software for semidefinite programming with bound constraints (version 1.0)

• Regularity and Sensitivity for McKean-Vlasov SPDEs

• Finding Dominant User Utterances And System Responses in Conversations

• $k$-Foldability of Words

• Half of an antipodal spherical design

• Detecting Multiple Random Changepoints in Bayesian Piecewise Growth Mixture Models

• Dimensionality reduction methods for molecular simulations

• Recursive formulae in regularity structures

• JESC: Japanese-English Subtitle Corpus

• Percolation without FKG

• On the Consistency of Quick Shift

• Using the quantization error from Self-Organized Map (SOM) output for detecting critical variability in large bodies of image time series in less than a minute

• Robust adaptive efficient estimation for a semi-Markov continuous time regression from discrete data

• Delivery Time Minimization in Edge Caching: Synergistic Benefits of Subspace Alignment and Zero Forcing

• If it ain’t broke, don’t fix it: Sparse metric repair

• Multi-Armed Bandits with Non-Stationary Rewards

• Improved Bounds for Testing Forbidden Order Patterns

• A Study on Topological Descriptors for the Analysis of 3D Surface Texture

• List-decodable zero-rate codes

• Wideband Channel Estimation for Hybrid Beamforming Millimeter Wave Communication Systems with Low-Resolution ADCs

• Narrowband Channel Estimation for Hybrid Beamforming Millimeter Wave Communication Systems with One-bit Quantization

• Discovery Radiomics with CLEAR-DR: Interpretable Computer Aided Diagnosis of Diabetic Retinopathy

• Local limit theorems and mod-phi convergence

• High-Precision Localization Using Ground Texture

• Maximum Likelihood Estimations Based on Upper Record Values for Probability Density Function and Cumulative Distribution Function in Exponential Family and Investigating Some of Their Properties

• Research on ruin probability of risk model based on AR(1) series

• Robust Optimal Design of Quantum Electronic Devices

• Training Probabilistic Spiking Neural Networks with First-to-spike Decoding

• Distributional Consistency of Lasso by Perturbation Bootstrap

• On Pre-Trained Image Features and Synthetic Images for Deep Learning

• Bayesian Nonparametric Differential Analysis for Dependent Multigroup Data with Application to Colorectal Cancer DNA Methylation

• A Saak Transform Approach to Efficient, Scalable and Robust Handwritten Digits Recognition

• Optimal Coded Multicast in Cache Networks with Arbitrary Content Placement

• Globally Optimal Symbolic Regression

• Simple and Effective Multi-Paragraph Reading Comprehension

• BAS: Beetle Antennae Search Algorithm for Optimization Problems

• Breaking the Madry Defense Model with $L_1$-based Adversarial Examples

• Can you find a face in a HEVC bitstream?

• Linearly convergent stochastic heavy ball method for minimizing generalization error

• Learning neural trans-dimensional random field language models with noise-contrastive estimation

• Implicit Causal Models for Genome-wide Association Studies

• Detection and Estimation of the Invisible Units Using Utility Data Based on Random Matrix Theory

• Crack Is Controllable, a controllable crack propagation method by using artificial neural network assisted particle swarm optimization

• Cascade Region Proposal and Global Context for Deep Object Detection

• Computational Social Choice and Computational Complexity: BFFs?

• Stationarity Region of Mm-Wave Channel Based on Outdoor Microcellular Measurements at 28 GHz

• Modeling Attention in Panoramic Video: A Deep Reinforcement Learning Approach

• Fair Termination for Parameterized Probabilistic Concurrent Systems (Technical Report)

• On an extremal problem involving a pair of forbidden posets

• PixelDefend: Leveraging Generative Models to Understand and Defend against Adversarial Examples

• Distance-based classifier by data transformation for high-dimension, strongly spiked eigenvalue models

• Communication-Avoiding Optimization Methods for Massive-Scale Graphical Model Structure Learning

• Frank-Wolfe methods for geodesically convex optimization with application to the matrix geometric mean

• Sequence-to-Sequence ASR Optimization via Reinforcement Learning

• Generative Adversarial Source Separation

• Sparse Vector Coding for Ultra-Reliable and Low Latency Communications

• Stochastic variance reduced multiplicative update for nonnegative matrix factorization

• An introduction to random matrix theory

• Performance Limits of Compressive Sensing Channel Estimation in Dense Cloud RAN

• DART: Distribution Aware Retinal Transform for Event-based Cameras

• Performance Analysis of Multi-Service Oriented Multiple Access Under General Channel Correlation

• Reliable Communication under the Influence of a State-Constrained Jammer: A Novel Perspective on Receive Diversity

• Hit Song Prediction for Pop Music by Siamese CNN with Ranking Loss

• Monotonicity and robustness in Wiener disorder detection

• Rough extreme learning machine: a new classification method based on uncertainty measure

• 2D Unitary ESPRIT Based Super-Resolution Channel Estimation for Millimeter-Wave Massive MIMO with Hybrid Precoding

• Generalized gradient optimization over lossy networks for partition-based estimation

• A Framework for Over-the-air Reciprocity Calibration for TDD Massive MIMO Systems

• Gradient Estimates on Dirichlet Eigenfunctions

• Verification of BSF Parallel Computational Model

• An algorithmic approach to handle circular trading in commercial taxing system

• An introduction to Wishart matrix moments

• Asymptotic analysis of average case approximation complexity of additive random fields

• Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming

• Factorizations of $k$-Nonnegative Matrices

• Sparse covariance matrix estimation in high-dimensional deconvolution

• Shifts of the prime divisor function of Alladi and Erdős

• Fast Linear Model for Knowledge Graph Embeddings

• Weak Stability of $\ell_1$-minimization Methods in Sparse Data Reconstruction

• Divisibility of binomial coefficients by powers of two

• Models with varying structure

• Calorimeter-less gamma-ray telescopes: Optimal measurement of charged particle momentum from multiple scattering by Bayesian analysis of Kalman filtering innovations

• Rectilinear and $\mathcal{O}$-convex hull with minimum area

• Open Set Logo Detection and Retrieval

• Level algebras and $\s$-lecture hall polytopes

• Learning to solve inverse problems using Wasserstein loss

• A Massively Parallel Algorithm for the Approximate Calculation of Inverse p-th Roots of Large Sparse Matrices

• Monochromatic Paths in the Complete Symmetric Infinite Digraph

• An FPTAS of Minimizing Total Weighted Completion Time on Single Machine with Position Constraint

• Abelian Schur groups of odd order

• Device-centric Energy Optimization for Edge Cloud Offloading

• Optimal Kernel-Based Dynamic Mode Decomposition

• Solution of linear ill-posed problems by model selection and aggregation

• The loss surface and expressivity of deep convolutional neural networks

• Numerical approximation of general Lipschitz BSDEs with branching processes

• Asymptotically efficient estimators for stochastic blockmodels: the naive MLE, the rank-constrained MLE, and the spectral

• Evidence for thermal activation in the glassy dynamics of insulating granular aluminum conductance

• A Supervised STDP-based Training Algorithm for Living Neural Networks

• At the Roots of Dictionary Compression: String Attractors

• A short proof of a lower bound for Turán numbers

• Content-based Representations of audio using Siamese neural networks

• Finding Connected Secluded Subgraphs

• Statistical validation of financial time series via visibility graph

• Convex duality in nonlinear optimal transport

• Conceptual Text Summarizer: A new model in continuous vector space

• A Derivative-Free Gauss-Newton Method

• Kirszbraun-type Theorems For Graphs

• Error Analysis for the Linear Feedback Particle Filter

• An Artificial-Noise-Aided Secure Scheme for Hybrid Parallel PLC/Wireless OFDM Systems

• Derivation of the stochastic Burgers equation with Dirichlet boundary conditions from the WASEP

• Limiting empirical spectral distribution for the non-backtracking matrix of an Erdős-Rényi random graph

• Rate-Splitting for Downlink Multi-User Multi-Antenna Systems: Bridging NOMA and Conventional Linear Precoding

• A new class of bell-shaped functions

• Named Entity Recognition in Twitter using Images and Text

• Probabilistic Count Matrix Factorization for Single Cell Expression Data Analysis

• Stochastic gradient descent performs variational inference, converges to limit cycles for deep networks

• Descent polynomials

• Machine Translation of Low-Resource Spoken Dialects: Strategies for Normalizing Swiss German

• How Should a Robot Assess Risk? Towards an Axiomatic Theory of Risk in Robotics

• Unsupervised Neural Machine Translation

• Isolation and connectivity in random geometric graphs with self-similar intensity measures

• Trends in European flood risk over the past 150 years

• A Connection between Feed-Forward Neural Networks and Probabilistic Graphical Models

• Semantic Code Repair using Neuro-Symbolic Transformation Networks

• Improved quantum annealer performance from oscillating transverse fields

• Techreport: Time-sensitive probabilistic inference for the edge

• Grad-CAM++: Generalized Gradient-based Visual Explanations for Deep Convolutional Networks

• Asymptotic degree distributions in large homogeneous random networks: A little theory and a counterexample

• On Fair Reinsurance Premiums; Capital Injections in a Perturbed Risk Model

• Convergence Rates of Latent Topic Models Under Relaxed Identifiability Conditions

• Summations of Linear Recurrent Sequences

• Continuous Authentication Using One-class Classifiers and their Fusion

• A mathematical bridge between discretized gauge theories in quantum physics and approximate reasoning in pairwise comparisons

• An Integrated Approach to Crowd Video Analysis: From Tracking to Multi-level Activity Recognition

• Eigenoption Discovery through the Deep Successor Representation

• Infinite dimensional compressed sensing from anisotropic measurements

• The Capacity of Private Computation