**Robust mixture regression modeling based on the Generalized M (GM)-estimation method**

**Modular Autoencoders for Ensemble Feature Extraction**

**Learning Simple Algorithms from Examples**

**Bayesian Minimal Description Lengths for Multiple Changepoint Detection**

**Improving the performance of the linear systems solvers using CUDA**

**Medusa: An Efficient Cloud Fault-Tolerant MapReduce**

**NearBucket-LSH: Efficient Similarity Search in P2P Networks**

**A PAC Approach to Application-Specific Algorithm Selection**

**Parallel Predictive Entropy Search for Batch Global Optimization of Expensive Objective Functions**

**Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes**

**Performance Analysis of Apriori Algorithm with Different Data Structures on Hadoop Cluster**

**Analysis of a Play by Means of CHAPLIN, the Characters and Places Interaction Network Software**

**Enumerating Periodic Points of Certain Sequential Dynamical Systems**

**On the Linear Algebraic Structure of Distributed Word Representations**

**Session-based Recommendations with Recurrent Neural Networks**

**BlackOut: Speeding up Recurrent Neural Network Language Models With Very Large Vocabularies**

**Online Sequence Training of Recurrent Neural Networks with Connectionist Temporal Classification**

**GradNets: Dynamic Interpolation Between Neural Architectures**

**Kernel Additive Principal Components**

**An Empirical Comparison of the Summarization Power of Graph Clustering Methods**

• Sustainability in the Stochastic Ramsey Model

• Anomalous Hall effect in 2D Rashba ferromagnet

• Approximation Algorithms for Route Planning with Nonlinear Objectives

• Convolutional Pseudo-Prior for Structured Labeling

• Right-handed bialgebras and the Prelie forest formula

• MazeBase: A Sandbox for Learning from Games

• Maximum of the characteristic polynomial of random unitary matrices

• 2D phononic crystals: Disorder matters

• Approximation of stochastic processes by non-expansive flows and coming down from infinity

• Cache Miss Estimation for Non-Stationary Request Processes

• Surpassing Humans in Boundary Detection using Deep Learning

• Equivalence of the Brownian and energy representations

• GPU-based Acceleration of Deep Convolutional Neural Networks on Mobile Platforms

• What is the plausibility of probability?(revised 2003, 2015)

• Black box variational inference for state space models

• Interpretable Two-level Boolean Rule Learning for Classification

• Switched Dynamical Latent Force Models for Modelling Transcriptional Regulation

• Directed random polymers via nested contour integrals

• Asymptotics for some polynomial patterns in the primes

• 1-perfectly orientable graphs and graph products

• Adapting the serial Alpgen event generator to simulate LHC collisions on millions of parallel threads

• The arithmetical rank of the edge ideals of graphs with pairwise disjoint cycles

• Spatial coherence of thermal photons favors photosynthetic life

• Ramsey numbers of trees and unicyclic graphs versus fans

• Block Matrix Formulations for Evolving Networks

• Convergence Results for a Class of Time-Varying Simulated Annealing Algorithms

• On Partitioning the Edges of 1-Planar Graphs

• Stochastic Parallel Block Coordinate Descent for Large-scale Saddle Point Problems

• Sparse Recovery via Partial Regularization: Models, Theory and Algorithms

• Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)

• Cyclic groups are CI-groups for balanced configurations

• Sparse Linear Models applied to Power Quality Disturbance Classification

• The number of trees in a graph

• Automorphism groups of Cayley graphs generated by block transpositions and regular Cayley maps

• Ridge Leverage Scores for Low-Rank Approximation

• A Python Extension for the Massively Parallel Multiphysics Simulation Framework waLBerla

• New results on maximal partial line spreads in PG(5,q)

• On the total $(k,r)$-domination number of random graphs

• Theta characteristics of hyperelliptic graphs

• Robust hedging of options on local time

• On sums of binomial coefficients modulo $p^2$

• Different kinds of chimera death states in nonlocally coupled oscillators

• Positive Discrete Spectrum of the Evolutionary Operator of Supercritical Branching Walks with Heavy Tails

• Noisy Submodular Maximization via Adaptive Sampling with Applications to Crowdsourced Image Collection Summarization

• Multi-Agent Continuous Transportation with Online Balanced Partitioning

• Hölder-type inequalities and their applications to concentration and correlation bounds

• Longest Gapped Repeats and Palindromes

• Detection of Uniform and Non-Uniform Differential Item Functioning by Item Focussed Trees

• Developing a High Performance Software Library with MPI and CUDA for Matrix Computations

• Associativity and non-associativity of some hypergraph products

• A supersymmetric approach to martingales related to the vertex-reinforced jump process

• Increasing the minimum distance of codes by twisting

• The Kuramoto model in complex networks

• Trajectories entropy in dynamical graphs with memory

• An integral inequality for the invariant measure of a stochastic reaction–diffusion equation

• What Happened to My Dog in That Network: Unraveling Top-down Generators in Convolutional Neural Networks

• Cascading Denoising Auto-Encoder as a Deep Directed Generative Model

• On the Generalization Error Bounds of Neural Networks under Diversity-Inducing Mutual Angular Regularization

• Modelling latent individual heterogeneity in mark-recapture data with Dirichlet process priors

• Efficient MCMC implementation of multi-state mark-recapture models

• Metric Entropy estimation using o-minimality Theory

• Exponential decay rate of partial autocorrelation coefficients of ARMA and short-memory processes

• Multiple–Instance Learning: Christoffel Function Approach to Distribution Regression Problem

• Constructions in Ramsey theory

• Site and bond percolation thresholds in $K_{n,n}$-based lattices: Vulnerability of quantum annealers to random qubit and coupler failures on Chimera topologies

• Max-sum diversity via convex programming

• A Plausible Memristor Implementation of Deep Learning Neural Networks

• Which Regular Expression Patterns are Hard to Match?

• ReSeg: A Recurrent Neural Network for Object Segmentation

• The False Discovery Rate (FDR) of Multiple Tests in a Class Room Lecture

• Constant Factor Approximation for ATSP with Two Edge Weights

• Another Generalization of Unimodality

• Detecting Road Surface Wetness from Audio: A Deep Learning Approach

• Partial Coherence Estimation via Spectral Matrix Shrinkage under Quadratic Loss

• Pattern Recognition on Oriented Matroids: Symmetric Cycles in the Hypercube Graphs

• On a Natural Dynamics for Linear Programming

• Convergence of Stochastic Interacting Particle Systems in Probability under a Sobolev Norm

• Understanding Music Playlists

• Non-Sentential Utterances in Dialogue: Experiments in Classification and Interpretation

• Standardness as an invariant formulation of independence

• End-to-end Learning of Action Detection from Frame Glimpses in Videos

• On the quasi-depth of squarefree monomial ideals and the sdepth of the monomial ideal of independent sets of a graph

• Generating Configurable Hardware from Parallel Patterns

• Online Semi-Supervised Learning with Deep Hybrid Boltzmann Machines and Denoising Autoencoders

• Gradual DropIn of Layers to Train Very Deep Neural Networks

• Discretisations of rough stochastic PDEs

• First passage percolation on the exponential of two-dimensional branching random walk

• Evaluating Prerequisite Qualities for Learning End-to-End Dialog Systems

• Large deviations for empirical measures generated by Gibbs measures with singular energy functionals

• Recycling intermediate steps to improve Hamiltonian Monte Carlo

• ICU Patient Deterioration prediction: a Data-Mining Approach

• A Simple Algorithm For Replacement Paths Problem

• On a conjecture for the signless Laplacian spectral radius of cacti with given matching number

• Bayesian binary quantile regression for the analysis of Bachelor-Master transition

• Near-Optimal Active Learning of Multi-Output Gaussian Processes

• Gaussian Process Planning with Lipschitz Continuous Reward Functions: Towards Unifying Bayesian Optimization, Active Learning, and Beyond

• The UIPQ seen from a point at infinity along its geodesic ray

• Zoom Better to See Clearer: Huamn Part Segmentation with Auto Zoom Net

• The a-graph coloring problem

• Data-dependent Initializations of Convolutional Neural Networks

• Discovering Internal Representations from Object-CNNs Using Population Encoding

• Reconstructing complex networks with binary-state dynamics

• Practical survival analysis tools for heterogeneous cohorts and informative censoring

• A State Calculus for Graph Coloring

• Mapping Images to Sentiment Adjective Noun Pairs with Factorized Neural Nets

• Expected Number and Height Distribution of Critical Points of Smooth Isotropic Gaussian Random Fields

• Semi-supervised Bootstrapping approach for Named Entity Recognition

• EMinRET: Heuristic for Energy-Aware VM Placement with Fixed Intervals and Non-preemption

• Smooth, identifiable supermodels of discrete DAG models with latent variables

• Learning visual groups from co-occurrences in space and time

• Optimal control of branching diffusion processes: a finite horizon problem

• Levi’s Lemma, pseudolinear drawings of $K_n$, and empty triangles

• Adding Gradient Noise Improves Learning for Very Deep Networks

• Conducting sparse feature selection on arbitrarily long phrases in text corpora with a focus on interpretability

• The 4-Regular Edge-Transitive Graphs of Girth 4

• Computerizing the Andrews-Fraenkel-Sellers Proofs on the Number of m-ary partitions mod m (and doing MUCH more!)

• Pseudoachromatic and connected-pseudoachromatic indices of the complete graph

• Burning a Graph is Hard

• Unifying and Strengthening Hardness for Dynamic Problems via the Online Matrix-Vector Multiplication Conjecture

• PLDA with Two Sources of Inter-session Variability