• **Doctor AI: Predicting Clinical Events via Recurrent Neural Networks**

• **Metric Learning with Adaptive Density Discrimination**

• **Probabilistic K-Means using Method of Moments**

• **A note on probability metrics in a categorical setting**

• Prioritized Experience Replay

• Staleness-aware Async-SGD for Distributed Deep Learning

• Least squares estimation for the subcritical Heston model based on continuous time observations

• ACDC: A Structured Efficient Linear Layer

• Weighted multiple ergodic averages and correlation sequences

• Unitary-Group Invariant Kernels and Features from Transformed Unlabeled Data

• From generalized Tamari intervals to non-separable planar maps (extended abstract)

• On the Global Linear Convergence of Frank-Wolfe Optimization Variants

• Combining Neural Networks and Log-linear Models to Improve Relation Extraction

• Bayesian quantile regression analysis for continuous data with a discrete component at zero

• Automatic Region-wise Spatially Varying Coefficient Regression Model: an Application to National Cardiovascular Disease Mortality and Air Pollution Association Study

• Mean-Field interacton of Brownian occupation measures. II: A rigorous construction of the Pekar process

• Behavior Query Discovery in System-Generated Temporal Graphs

• Comparison of viscosity solutions of fully nonlinear degenerate parabolic Path-dependent PDEs

• Censoring Representations with an Adversary

• Infinite excursions of router walks on regular trees

• Randomization can be as helpful as a glimpse of the future in online computation

• Generation of scenarios from calibrated ensemble forecasts with a dynamic ensemble copula coupling approach

• On Mäkelä’s Conjectures: deciding if a morphic word avoids long abelian-powers

• Fast Saddle-Point Algorithm for Generalized Dantzig Selector and FDR Control with the Ordered $\ell_1$-Norm

• Matrix-Ball Construction of affine Robinson-Schensted correspondence

• Anomalous Contagion and Renormalization in Dynamical Networks with Nodal Mobility

• Trees with small b-chromatic index

• The Hopf Algebra of graph invariants

• On an adaptive preconditioned Crank-Nicolson algorithm for infinite dimensional Bayesian inferences

• Using Machine Learning to Predict the Outcome of English County twenty over Cricket Matches

• Alternative Markov Properties for Acyclic Directed Mixed Graphs

• The relationship between internet user type and user performance when carrying out simple vs. complex search tasks

• A Framework for Evaluating the Retrieval Effectiveness of Search Engines

• The retrieval effectiveness of search engines on navigational queries

• The Influence of Commercial Intent of Search Results on Their Perceived Relevance

• Ranking library materials

• What Users See – Structures in Search Engine Results Pages

• The Retrieval Effectiveness of Web Search Engines: Considering Results Descriptions

• Problems with the use of Web search engines to find results in foreign languages

• Metric learning for graph-based label propgation

• The historical Moran model

• Nonparametric estimation for irregularly sampled Lévy processes

• Cache-Conscious Run-time Decomposition of Data Parallel Computations

• Uniqueness of the extreme cases in theorems of Drisko and Erdős-Ginzburg-Ziv

• Toward Transparent Heterogeneous Systems

• Continued Classification of 3D Lattice Walks in the Positive Octant

• Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction

• Solution Repair/Recovery in Uncertain Optimization Environment

• Penalized complexity priors for degrees of freedom in Bayesian P-splines

• Infinite-dimensional calculus under weak spatial regularity of the processes

• Sparse learning of maximum likelihood model for optimization of complex loss function

• The classical-quantum divergence of complexity in the Ising spin chain

• On the minimum of a conditioned Brownian bridge

• Dummy variables and their interactions in regression analysis: examples from research on body mass index

• Generation and motion of interfaces in one-dimensional stochastic Allen-Cahn equation

• Online learning in repeated auctions

• Using Abduction in Markov Logic Networks for Root Cause Analysis

• Complex-Valued Gaussian Processes for Regression: A Widely Non-Linear Approach

• Efficient Output Kernel Learning for Multiple Tasks

• Hyperspectral Unmixing in Presence of Endmember Variability, Nonlinearity or Mismodelling Effects

• One to rule them all: a general method for fast computation on semirings isomorphic to $(\times, \max)$ on $\mathbb{R}_+$

• A Distribution Adaptive Framework for Prediction Interval Estimation Using Nominal Variables

• Preimages under the Stack-Sorting Algorithm

• Wishart Mechanism for Differentially Private Principal Components Analysis

• Expressiveness of Rectifier Networks

• Discovering Underlying Plans Based on Distributed Representations of Actions

• Bayesian hypothesis testing for one bit compressed sensing with sensing matrix perturbation

• Learning Discriminative Representations for Semantic Cross Media Retrieval

• Why are deep nets reversible: A simple theory, with implications for training

• Tree-Guided MCMC Inference for Normalized Random Measure Mixture Models

• The Invisible Hand of Dynamic Market Pricing

• Adversarial Autoencoders

• A New Smooth Approximation to the Zero One Loss with a Probabilistic Interpretation

• Net2Net: Accelerating Learning via Knowledge Transfer

• Discrete one-dimensional oriented percolation of intervals

• Competitive Multi-scale Convolution

• Local entropy as a measure for sampling solutions in Constraint Satisfaction Problems

• Two laws of large numbers for sublinear expectations

• Marginalized Two Part Models for Generalized Gamma Family of Distributions

• MOEA/D-GM: Using probabilistic graphical models in MOEA/D for solving combinatorial optimization problems

• Predicting distributions with Linearizing Belief Networks

• Learning Structured Inference Neural Networks with Label Relations

• A Bayesian Semiparametric Framework for Understanding and Predicting Customer Base Dynamics

• A Block Regression Model for Short-Term Mobile Traffic Forecasting

• Co-modularity and Co-community Detection in Large Networks

• Identifying the Absorption Bump with Deep Learning

• Rescue of endemic states in interconnected networks with adaptive coupling

• blavaan: Bayesian structural equation models via parameter expansion

• Semiparametric Estimation of CES Demand System with Observed and Unobserved Product Characteristics