Ontology-driven Information Extraction

Homogeneous unstructured data (HUD) are collections of unstructured documents that share common properties, such as similar layout, common file format, or common domain of values. Building on such properties, it would be desirable to automatically process HUD to access the main information through a semantic layer — typically an ontology — called semantic view. Hence, we propose an ontology-based approach for extracting semantically rich information from HUD, by integrating and extending recent technologies and results from the fields of classical information extraction, table recognition, ontologies, text annotation, and logic programming. Moreover, we design and implement a system, named KnowRex, that has been successfully applied to curriculum vitae in the Europass style to offer a semantic view of them, and be able, for example, to select those which exhibit required skills.


Modeling Progress in AI

Participants in recent discussions of AI-related issues ranging from intelligence explosion to technological unemployment have made diverse claims about the nature, pace, and drivers of progress in AI. However, these theories are rarely specified in enough detail to enable systematic evaluation of their assumptions or to extrapolate progress quantitatively, as is often done with some success in other technological domains. After reviewing relevant literatures and justifying the need for more rigorous modeling of AI progress, this paper contributes to that research program by suggesting ways to account for the relationship between hardware speed increases and algorithmic improvements in AI, the role of human inputs in enabling AI capabilities, and the relationships between different sub-fields of AI. It then outlines ways of tailoring AI progress models to generate insights on the specific issue of technological unemployment, and outlines future directions for research on AI progress.


Domain Adaptation and Transfer Learning in StochasticNets

Transfer learning is a recent field of machine learning research that aims to resolve the challenge of dealing with insufficient training data in the domain of interest. This is a particular issue with traditional deep neural networks where a large amount of training data is needed. Recently, StochasticNets was proposed to take advantage of sparse connectivity in order to decrease the number of parameters that needs to be learned, which in turn may relax training data size requirements. In this paper, we study the efficacy of transfer learning on StochasticNet frameworks. Experimental results show ~7% improvement on StochasticNet performance when the transfer learning is applied in training step.


Learning the Preferences of Ignorant, Inconsistent Agents

An important use of machine learning is to learn what people value. What posts or photos should a user be shown? Which jobs or activities would a person find rewarding? In each case, observations of people’s past choices can inform our inferences about their likes and preferences. If we assume that choices are approximately optimal according to some utility function, we can treat preference inference as Bayesian inverse planning. That is, given a prior on utility functions and some observed choices, we invert an optimal decision-making process to infer a posterior distribution on utility functions. However, people often deviate from approximate optimality. They have false beliefs, their planning is sub-optimal, and their choices may be temporally inconsistent due to hyperbolic discounting and other biases. We demonstrate how to incorporate these deviations into algorithms for preference inference by constructing generative models of planning for agents who are subject to false beliefs and time inconsistency. We explore the inferences these models make about preferences, beliefs, and biases. We present a behavioral experiment in which human subjects perform preference inference given the same observations of choices as our model. Results show that human subjects (like our model) explain choices in terms of systematic deviations from optimal behavior and suggest that they take such deviations into account when inferring preferences.


Coloring curves intersecting a fixed line

Morphological Inflection Generation Using Character Sequence to Sequence Learning

Expectation propagation for diffusion processes by moment closure approximations

Poset splitting and minimality of finite models

Bayesian anti-sparse coding

On the shelling antimatroids of split graphs

When the extension property does not hold for vector space alphabets

Asymptotic Behavior of Mean Partitions in Consensus Clustering

Semi-discretization for stochastic scalar conservation laws with multiple rough fluxes

Products of random variables and the first digit phenomenon

The Cohen-Macaulayness of the bounded complex of an affine oriented matroid

Intrinsic Volumes of Polyhedral Cones: A combinatorial perspective

Diagonally and antidiagonally symmetric alternating sign matrices of odd order

Density estimation on the rotation group using diffusive wavelets

Nonlinear dielectric spectroscopy in a fragile plastic crystal

Distance-regular Cayley graphs with least eigenvalue $-2$

Advice Complexity of the Online Induced Subgraph Problem

Well-quasi-ordering and finite distinguishing number

Borell’s formula for a Riemannian manifold and applications

Can Pretrained Neural Networks Detect Anatomy?

Homology of spaces of directed paths in Euclidean pattern spaces

The uniqueness of a distance-regular graph with intersection array {32,27,8,1;1,4,27,32} and related results

Improved Balanced Flow Computation Using Parametric Flow

Complexity and Approximation of the Fuzzy K-Means Problem

Generation of cubic graphs and snarks with large girth

A combinatorial Hopf algebra for the boson normal ordering problem

A sharp threshold for van der Waerden’s theorem in random subsets

A Planning based Framework for Essay Generation

On Cyclic Kautz Digraphs

A novel Bayesian strategy for the identification of spatially-varying material properties and model validation: an application to static elastography

On Cube Tilings of Tori and Classification of Perfect Codes in the Maximum Metric

The scaling limit of a particle system with long-range interaction

Non-representable hyperbolic matroids

A faster fixed parameter algorithm for two-layer crossing minimization

The Quadripolar Relational Model: an Artificial Intelligence framework for the description of personality disorders

Entropy inequalities for stable densities and strengthened central limit theorems

Conditioning in spatial point processes

Optimization of Tree Modes for Parallel Hash Functions

Infinite friezes and triangulations of the strip

Deep Poisson Factorization Machines: factor analysis for mapping behaviors in journalist ecosystem

Learning Deep Convolutional Neural Networks for Places2 Scene Recognition

HALO: Report and Predicted Response Times

Successive Ray Refinement and Its Application to Coordinate Descent for LASSO