**On generalized inferential models**

**Top-N recommendations from expressive recommender systems**

**Embarrassingly Parallel Time Series Analysis for Large Scale Weak Memory Systems**

**Bayesian inference via rejection filtering**

**Neural Network Matrix Factorization**

**Towards Principled Unsupervised Learning**

**Joint Word Representation Learning using a Corpus and a Semantic Lexicon**

**Comparative Study of Caffe, Neon, Theano, and Torch for Deep Learning**

**Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks**

**Canonical Autocorrelation Analysis**

**Unsupervised and Semi-supervised Learning with Categorical Generative Adversarial Networks**

**sense2vec – A Fast and Accurate Method for Word Sense Disambiguation In Neural Word Embeddings**

**Dynamic Adaptive Network Intelligence**

• A deconvolution path for mixtures

• Data-Dependent Path Normalization in Neural Networks

• Images Don’t Lie: Transferring Deep Visual Semantic Features to Large-Scale Multimodal Learning to Rank

• Training CNNs with Low-Rank Filters for Efficient Image Classification

• Sequence Level Training with Recurrent Neural Networks

• A Bayesian hidden Markov mixture model to detect overexpressed chromosome regions

• Hand Pose Estimation through Weakly-Supervised Learning of a Rich Intermediate Representation

• Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters

• Improving Neural Machine Translation Models with Monolingual Data

• Generalizations of the Strong Arnold Property and the minimum number of distinct eigenvalues of a graph

• Stories in the Eye: Contextual Visual Interactions for Efficient Video to Language Translation

• The local structure of $q$-Gaussian processes

• The partial copula: Properties and associated dependence measures

• From restricted permutations to Grassmann necklaces and back again

• F-Index of Some Graph Operations

• Using Deep Learning to Predict Demographics from Mobile Phone Metadata

• Bivariate Binomial Moments and Bonferroni-type Inequalities

• Compressed and quantized correlation estimators

• Use of Eigenvector Centrality to Detect Graph Isomorphism

• Data Representation and Compression Using Linear-Programming Approximations

• Exponential Natural Particle Filter

• Polysemy in Controlled Natural Language Texts

• Crowd Behavior Analysis: A Review where Physics meets Biology

• Dueling Network Architectures for Deep Reinforcement Learning

• Hankel Matrices for the Period-Doubling Sequence

• Tree Embedding and Directed Steiner Problems

• Near-Optimal UGC-hardness of Approximating Max k-CSP_R

• Bayesian identification of bacterial strains from sequencing data

• On the weak convergence of the empirical conditional copula under a simplifying assumption

• Effects of the tempered aging and its Fokker-Planck equation

• Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications

• Optimal Approximate Designs for Comparison with Control in Dose-Escalation Studies

• Integrating Deep Features for Material Recognition

• The Euler method for continuous-time nonlinear filtering and stable convergence of conditional law

• Emergence of strongly connected giant components in continuum disk-spin percolation

• Resiliency of Deep Neural Networks under Quantization

• mplrs: A scalable parallel vertex/facet enumeration code

• Trivializing The Energy Landscape Of Deep Networks

• Quasipolynomial Solutions to the Hofstadter Q-Recurrence

• Variance Reduction in SGD by Distributed Importance Sampling

• On Binary Embedding using Circulant Matrices

• Every finite set of integers is an asymptotic approximate group

• Faster Parallel Solver for Positive Linear Programs via Dynamically-Bucketed Selective Coordinate Descent

• Unitary Evolution Recurrent Neural Networks

• Joint Inverse Covariances Estimation with Mutual Linear Structure

• DOC: Deep OCclusion Recovering From A Single Image

• Task Loss Estimation for Sequence Prediction

• Variational Auto-encoded Deep Gaussian Processes

• Deep Metric Learning via Lifted Structured Feature Embedding

• Learning to decompose for object detection and instance segmentation

• Learning Representations from EEG with Deep Recurrent-Convolutional Neural Networks

• Universality in halting time and its applications in optimization

• Learning metrics by learning constrained embeddings of objects to Rn

• A parallel algorithm for the constrained shortest path problem on lattice graphs

• A convnet for non-maximum suppression

• Delving Deeper into Convolutional Networks for Learning Video Representations

• Deconstructing the Ladder Network Architecture

• A Controller Recognizer Framework: How necessary is recognition for control?

• Reasoning in Vector Space: An Exploratory Study of Question Answering

• First Step toward Model-Free, Anonymous Object Tracking with Recurrent Neural Networks

• An Information Retrieval Approach to Finding Dependent Subspaces of Multiple Views

• All you need is a good init

• Deep Manifold Traversal: Changing Labels with Convolutional Features

• Skip-Thought Memory Networks

• Binding via Reconstruction Clustering

• Modelling Spatial Compositional Data: Reconstructions of past land cover and uncertainties

• Fast Parallel SAME Gibbs Sampling on General Discrete Bayesian Networks

• QBDC: Query by dropout committee for training deep supervised architecture

• Direct Loss Minimization for Training Deep Neural Nets

• Better Computer Go Player with Neural Network and Long-term Prediction

• Learning to generate images with perceptual similarity metrics

• Recurrent Models for Auditory Attention in Multi-Microphone Distance Speech Recognition

• Denoising Criterion for Variational Auto-Encoding Framework

• Minimum disparity estimation in controlled branching processes

• Multilingual Relation Extraction using Compositional Universal Schema

• Geodesics of learned representations

• Fixed Point Quantization of Deep Convolutional Networks

• Neural Random-Access Machines

• Order Matters: Sequence to sequence for sets

• Griffiths effects and slow dynamics in nearly many-body localized systems

• A Unified Gradient Regularization Family for Adversarial Examples

• Iterative Refinement of Approximate Posterior for Training Directed Belief Networks

• Manifold Regularized Deep Neural Networks using Adversarial Examples

• Unsupervised Learning of Visual Structure using Predictive Generative Networks

• Characterizing graphs of maximum principal ratio

• Quantum phases of interacting electrons in three-dimensional dirty Dirac semimetals