Efficient Learning by Directed Acyclic Graph For Resource Constrained Prediction

We study the problem of reducing test-time acquisition costs in classification systems. Our goal is to learn decision rules that adaptively select sensors for each example as necessary to make a confident prediction. We model our system as a directed acyclic graph (DAG) where internal nodes correspond to sensor subsets and decision functions at each node choose whether to acquire a new sensor or classify using the available measurements. This problem can be naturally posed as an empirical risk minimization over training data. Rather than jointly optimizing such a highly coupled and non-convex problem over all decision nodes, we propose an efficient algorithm motivated by dynamic programming. We learn node policies in the DAG by reducing the global objective to a series of cost sensitive learning problems. Our approach is computationally efficient and has proven guarantees of convergence to the optimal system for a fixed architecture. In addition, we present an extension to map other budgeted learning problems with large number of sensors to our DAG architecture and demonstrate empirical performance exceeding state-of-the-art algorithms for data composed of both few and many sensors.

Parser for Abstract Meaning Representation using Learning to Search

We develop a novel technique to parse English sentences into Abstract Meaning Representation (AMR) using SEARN, a Learning to Search approach, by modeling the concept and the relation learning in a unified framework. We evaluate our parser on multiple datasets from varied domains and show an absolute improvement of 2% to 6% over the state-of-the-art. Additionally we show that using the most frequent concept gives us a baseline that is stronger than the state-of-the-art for concept prediction. We plan to release our parser for public use.

Using Shortlists to Support Decision Making and Improve Recommender System Performance

In this paper, we study the impact of design choices for recommender systems on one-choice tasks where users want to select one item out of a variety of options. Instead of focusing on only user factors or recommendation quality, we consider how an interface design that provides the user with digital short-term memory impacts both user behavior and recommendation quality. In particular, we focus on improving recommendations and user experience through the use of shortlists. A shortlist is a temporary list of candidates that the user is currently considering, e.g., a list of a few movies the user is currently considering for viewing. Users can off-load the items they want to keep in memory to the shortlist — thereby decreasing their cognitive load. Since shortlists are designed to support the user’s task, they provide more implicit feedback naturally as part of the workflow in making decisions rather than artificially soliciting explicit feedback. This feedback provides additional data for training recommendation systems which in turn improves how well the recommendation models predict a user’s preferences. We perform a user study with a movie recommendation setup to compare interfaces that offer shortlist support with those that do not. From the user studies we conclude: (i) users make better decisions with a shortlist; (ii) users prefer an interface with shortlist support; and (iii) the additional implicit feedback from sessions with a shortlist improves the quality of recommendations by nearly a factor of two.

Empirical Study on Deep Learning Models for QA

In this paper we explore deep learning models with memory component or attention mechanism for question answering task. We combine and compare three models, Neural Machine Translation, Neural Turing Machine, and Memory Networks for a simulated QA data set. This paper is the first one that uses Neural Machine Translation and Neural Turing Machines for solving QA tasks. Our results suggest that the combination of attention and memory have potential to solve certain QA problem.

Object Oriented Analysis using Natural Language Processing concepts: A Review

The Software Development Life Cycle (SDLC) starts with eliciting requirements of the customers in the form of Software Requirement Specification (SRS). SRS document needed for software development is mostly written in Natural Language(NL) convenient for the client. From the SRS document only, the class name, its attributes and the functions incorporated in the body of the class are traced based on pre-knowledge of analyst. The paper intends to present a review on Object Oriented (OO) analysis using Natural Language Processing (NLP) techniques. This analysis can be manual where domain expert helps to generate the required diagram or automated system, where the system generates the required diagram, from the input in the form of SRS.

The Human Kernel

Bayesian nonparametric models, such as Gaussian processes, provide a compelling framework for automatic statistical modelling: these models have a high degree of flexibility, and automatically calibrated complexity. However, automating human expertise remains elusive, for example, Gaussian processes with standard kernels struggle on function extrapolation problems that are trivial for human learners. In this paper, we create function extrapolation problems and acquire human responses, and then design a kernel learning framework to reverse engineer the inductive biases of human learners across a set of behavioral experiments. We use the learned kernels to gain psychological insights and to extrapolate in human-like ways that go beyond traditional stationary and polynomial kernels. Finally, we investigate Occam’s razor in human and Gaussian process based function learning.

A Framework for Distributed Deep Learning Layer Design in Python

In this paper, a framework for testing Deep Neural Network (DNN) design in Python is presented. First, big data, machine learning (ML), and Artificial Neural Networks (ANNs) are discussed to familiarize the reader with the importance of such a system. Next, the benefits and detriments of implementing such a system in Python are presented. Lastly, the specifics of the system are explained, and some experimental results are presented to prove the effectiveness of the system.

Comparative Document Analysis for Large Text Corpora

This paper presents a novel research problem on joint discovery of commonalities and differences between two individual documents (or document sets), called Comparative Document Analysis (CDA). Given any pair of documents from a document collection, CDA aims to automatically identify sets of quality phrases to summarize the commonalities of both documents and highlight the distinctions of each with respect to the other informatively and concisely. Our solution uses a general graph-based framework to derive novel measures on phrase semantic commonality and pairwise distinction}, and guides the selection of sets of phrases by solving two joint optimization problems. We develop an iterative algorithm to integrate the maximization of phrase commonality or distinction measure with the learning of phrase-document semantic relevance in a mutually enhancing way. Experiments on text corpora from two different domains—scientific publications and news—demonstrate the effectiveness and robustness of the proposed method on comparing individual documents. Our case study on comparing news articles published at different dates shows the power of the proposed method on comparing document sets.

A queueing/inventory and an insurance risk model

Finite range decomposition for a general class of elliptic operators

New bounds on Simonyi’s conjecture

Edge conflicts do not determine geodesics in the associahedron

Even Orientations and Pfaffian graphs

Inter-layer synchronization in multiplex networks

Multiple binomial sums

Edge-Linear First-Order Dependency Parsing with Undirected Minimum Spanning Tree Inference

Polynomial Chaos-based Bayesian Inference of K-Profile Parametrization in a General Circulation Model of the Tropical Pacific

A Parallel algorithm for $\mathcal{X}$-Armed bandits

Packing large trees of consecutive orders

Recognizing Union-Find trees is NP-complete

Results on the solutions of maximum weighted Renyi entropy problems

On the law of homogeneous stable functionals

Mixed molecular motor traffic on nucleic acid tracks: models of transcriptional interference and regulation of gene expression

A geometric Achlioptas process

Anisotropic scaling of random grain model with application to network traffic

Crushing runtimes in adiabatic quantum computation with Energy Landscape Manipulation (ELM): Application to Quantum Factoring

How to merge three different methods for information filtering ?

Weighted scores estimating equations for longitudinal ordinal data

Fast hierarchical solvers for sparse matrices

Empirical Uncertain Bayes Methods in Area-level Models

Distributed Communication in Bare-Bones Wireless Networks

Local semicircle law under moment conditions. Part I: The Stieltjes transform

Safe Control under Uncertainty

Packing densities of layered permutations and the minimum number of monotone sequences in layered permutations

Scaled subordinators and generalizations of the Indian buffet process

Analysis of Cortical Morphometric Variability Using Labeled Cortical Distance Maps

On partitions with fixed number of even-indexed and odd-indexed odd parts

High dimensional regression and matrix estimation without tuning parameters

A Bootstrap Likelihood approach to Bayesian Computation

Long-range Acoustic Interactions in Insect Swarms: An Adaptive Gravity Model

Federated Scheduling Admits No Constant Speedup Factors for Constrained-Deadline DAG Task System

Aqua Computing: Coupling Computing and Communications

Distributional Results Relating to the Posterior of a Dirichlet Process Prior

Formulas for Partition $k$-Tuples with $t$-Cores

Handle slides for delta-matroids

Pricing of high-dimensional options

Intuitive Considerations Clarifying the Origin and Applicability of the Benford Law

Erdos distinct distance problem and related results over finite valuation rings

An Efficient Implementation for WalkSAT

Upper bounds for the dimension of tori acting on GKM manifolds

On End-to-End Program Generation from User Intention by Deep Neural Networks

Vehicle Speed Prediction using Deep Learning

Statistical Parsing by Machine Learning from a Classical Arabic Treebank

Descent c-Wilf Equivalence

On the Dominating Set Problem in Random Graphs

Spectral bounds for the $k$-independence number of a graph

Memory-Adjustable Navigation Piles with Applications to Sorting and Convex Hulls

Normal Power Series Class of Distributions: Model, Properties and Applications

On problems of Danzer and Gowers and dynamics on the space of closed subsets of $\mathbb{R}^d$

Fast and Scalable Lasso via Stochastic Frank-Wolfe Methods with a Convergence Guarantee

Evolutionary Landscape and Management of Population Diversity

Fast Parameter Estimation in Loss Tomography for Networks of General Topology

Finding Golden Nuggets by Reduction

Dependent Random Density Functions with Common Atoms and Pairwise Dependence

Mobility and Energy Conscious Clustering Protocol for Wireless Networks

Data-driven detrending of nonstationary fractal time series with echo state networks

An adaptive-to-model test for partially parametric single-index models

$L_p$ regular sparse hypergraphs: box norms

$L_p$ regular sparse hypergrahps

Induced minors and well-quasi-ordering

Non-separable Dynamic Nearest-Neighbor Gaussian Process Models for Large spatio-temporal Data With an Application to Particulate Matter Analysis

Bayesian Inference for High Dimensional Changing Linear Regression with Application to Minnesota House Price Index Data

CoCoLasso for High-dimensional Error-in-variables Regression

Dynamic programming approach to principal-agent problems

Theoretical Analysis of Nonparametric Filament Estimation

Combine CRF and MMSEG to Boost Chinese Word Segmentation in Social Media

On the maximum running time in graph bootstrap percolation

Comment on ‘Direct evidence for strong crossover of collective excitations and fast sound in the supercritical state’

On the existence of unparalleled even cycle systems

On the Hamilton-Waterloo Problem with odd orders

Inference with Dyadic Data: Asymptotic Behavior of the Dyadic-Robust t-Statistic

Eigenvector dynamics under perturbation of modular networks

Surprising Relations Between Sums-Of-Squares of Characters of the Symmetric Group Over Two-Rowed Shapes and Over Hook Shapes