A Cognitive Architecture Based on a Learning Classifier System with Spiking Classifiers

Learning Classifier Systems (LCS) are population-based reinforcement learners that were originally designed to model various cognitive phenomena. This paper presents an explicitly cognitive LCS by using spiking neural networks as classifiers, providing each classifier with a measure of temporal dynamism. We employ a constructivist model of growth of both neurons and synaptic connections, which permits a Genetic Algorithm (GA) to automatically evolve sufficiently-complex neural structures. The spiking classifiers are coupled with a temporally-sensitive reinforcement learning algorithm, which allows the system to perform temporal state decomposition by appropriately rewarding ‘macro-actions,’ created by chaining together multiple atomic actions. The combination of temporal reinforcement learning and neural information processing is shown to outperform benchmark neural classifier systems, and successfully solve a robotic navigation task.

An Event Network for Exploring Open Information

In this paper, an event network is presented for exploring open information, where linguistic units about an event are organized for analysing. The process is divided into three steps: document event detection, event network construction and event network analysis. First, by implementing event detection or tracking, documents are retrospectively (or on-line) organized into document events. Secondly, for each of the document event, linguistic units are extracted and combined into event networks. Thirdly, various analytic methods are proposed for event network analysis. In our application methodologies are presented for exploring open information.

Bayesian Networks for Variable Groups

Bayesian networks, and especially their structures, are powerful tools for representing conditional independencies and dependencies between random variables. In applications where related variables form a priori known groups, chosen to represent different ‘views’ to or aspects of the same entities, one may be more interested in modeling dependencies between groups of variables rather than between individual variables. Motivated by this, we study prospects of representing relationships between variable groups using Bayesian network structures. We show that for dependency structures between groups to be learnable, the data have to satisfy the so-called groupwise faithfulness assumption. We also show that one cannot learn causal relations between groups using only groupwise conditional independencies, but also variable-wise relations are needed. Additionally, we present algorithms for finding the groupwise dependency structures.

Calibration of One-Class SVM for MV set estimation

A general approach for anomaly detection or novelty detection consists in estimating high density regions or Minimum Volume (MV) sets. The One-Class Support Vector Machine (OCSVM) is a state-of-the-art algorithm for estimating such regions from high dimensional data. Yet it suffers from practical limitations. When applied to a limited number of samples it can lead to poor performance even when picking the best hyperparameters. Moreover the solution of OCSVM is very sensitive to the selection of hyperparameters which makes it hard to optimize in an unsupervised setting. We present a new approach to estimate MV sets using the OCSVM with a different choice of the parameter controlling the proportion of outliers. The solution function of the OCSVM is learnt on a training set and the desired probability mass is obtained by adjusting the offset on a test set to prevent overfitting. Models learnt on different train/test splits are then aggregated to reduce the variance induced by such random splits. Our approach makes it possible to tune the hyperparameters automatically and obtain nested set estimates. Experimental results show that our approach outperforms the standard OCSVM formulation while suffering less from the curse of dimensionality than kernel density estimates. Results on actual data sets are also presented.

Computational Sociolinguistics: A Survey

Language is a social phenomenon and inherent to its social nature is that it is constantly changing. Recently, a surge of interest can be observed within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of ‘Computational Sociolinguistics’ that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions employed in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.

Graphulo: Linear Algebra Graph Kernels for NoSQL Databases

Big data and the Internet of Things era continue to challenge computational systems. Several technology solutions such as NoSQL databases have been developed to deal with this challenge. In order to generate meaningful results from large datasets, analysts often use a graph representation which provides an intuitive way to work with the data. Graph vertices can represent users and events, and edges can represent the relationship between vertices. Graph algorithms are used to extract meaningful information from these very large graphs. At MIT, the Graphulo initiative is an effort to perform graph algorithms directly in NoSQL databases such as Apache Accumulo or SciDB, which have an inherently sparse data storage scheme. Sparse matrix operations have a history of efficient implementations and the Graph Basic Linear Algebra Subprogram (GraphBLAS) community has developed a set of key kernels that can be used to develop efficient linear algebra operations. However, in order to use the GraphBLAS kernels, it is important that common graph algorithms be recast using the linear algebra building blocks. In this article, we look at common classes of graph algorithms and recast them into linear algebra operations using the GraphBLAS building blocks.

ROOT – A C++ Framework for Petabyte Data Storage, Statistical Analysis and Visualization

ROOT is an object-oriented C++ framework conceived in the high-energy physics (HEP) community, designed for storing and analyzing petabytes of data in an efficient way. Any instance of a C++ class can be stored into a ROOT file in a machine-independent compressed binary format. In ROOT the TTree object container is optimized for statistical data analysis over very large data sets by using vertical data storage techniques. These containers can span a large number of files on local disks, the web, or a number of different shared file systems. In order to analyze this data, the user can chose out of a wide set of mathematical and statistical functions, including linear algebra classes, numerical algorithms such as integration and minimization, and various methods for performing regression analysis (fitting). In particular, ROOT offers packages for complex data modeling and fitting, as well as multivariate classification based on machine learning techniques. A central piece in these analysis tools are the histogram classes which provide binning of one- and multi-dimensional data. Results can be saved in high-quality graphical formats like Postscript and PDF or in bitmap formats like JPG or GIF. The result can also be stored into ROOT macros that allow a full recreation and rework of the graphics. Users typically create their analysis macros step by step, making use of the interactive C++ interpreter CINT, while running over small data samples. Once the development is finished, they can run these macros at full compiled speed over large data sets, using on-the-fly compilation, or by creating a stand-alone batch program. Finally, if processing farms are available, the user can reduce the execution time of intrinsically parallel tasks – e.g. data mining in HEP – by using PROOF, which will take care of optimally distributing the work over the available resources in a transparent way.

VARX-L: Structured Regularization for Large Vector Autoregressions with Exogenous Variables

The vector autoregression (VAR) has long proven to be an effective method for modeling the joint dynamics of macroe- conomic time series as well as forecasting. A major shortcomings of the VAR that has hindered its applicability is its heavy parameterization: the parameter space grows quadratically with the number of series included, quickly exhausting the available degrees of freedom. Consequently, forecasting using VARs is intractable for low-frequency, high-dimensional macroeconomic data. However, empirical evidence suggests that VARs that incorporate more component series tend to result in more accurate forecasts. Conventional methods that allow for the estimation of large VARs either tend to require ad hoc subjective specifications or are computationally infeasible. Moreover, as global economies become more intricately intertwined, there has been substantial interest in incorporating the impact of stochastic, unmodeled exogenous variables. Vector autoregression with exogenous variables (VARX) extends the VAR to allow for the inclusion of unmodeled variables, but it similarly faces dimensionality challenges. We introduce the VARX-L framework, a structured family of VARX models, and provide methodology which allows for both efficient estimation and accurate forecasting in high-dimensional analysis. VARX-L adapts several prominent scalar regression regularization techniques to a vector time series context to greatly reduce the parameter space of VAR and VARX models. We formulate convex optimization procedures that are amenable to efficient solutions for the time-ordered, high-dimensional problems we aim to solve. We also highlight a compelling extension that allows for shrinking toward reference models. We demonstrate the efficacy of VARX-L in both low- and high-dimensional macroeconomic applications and simulated data examples.

Word Representations, Tree Models and Syntactic Functions

Word representations induced from models with discrete latent variables (e.g.\ HMMs) have been shown to be beneficial in many NLP applications. In this work, we exploit labeled syntactic dependency trees and formalize the induction problem as unsupervised learning of tree-structured hidden Markov models. Syntactic functions are used as additional observed variables in the model, influencing both transition and emission components. Such syntactic information can potentially lead to capturing more fine-grain and functional distinctions between words, which, in turn, may be desirable in many NLP applications. We evaluate the word representations on two tasks — named entity recognition and semantic frame identification. We observe improvements from exploiting syntactic function information in both cases, and the results rivaling those of state-of-the-art representation learning methods. Additionally, we revisit the relationship between sequential and unlabeled-tree models and find that the advantage of the latter is not self-evident.

A BSDE arising in an exponential utility maximization problem in a pure jump market model

A Characterization of Triangle-free Gorenstein graphs and Cohen-Macaulayness of second powers of edge ideals

A linear time algorithm for quantum 2-SAT

A Note on Ihara Zeta Function of Large Random Graphs

A Note on the Importance of Weak Convergence Rates for SPDE Approximations in Multilevel Monte Carlo Schemes

A reaction diffusion-like formalism for plastic neural networks reveals dissipative solitons at criticality

A recursive structure of sand pile model and its applications

A relaxation of the strong Bordeaux Conjecture

A Secretary Problem with a Sliding Window for Recalling Applicants

A variational approach to some transport inequalities

Approximating (Unweighted) Tree Augmentation via Lift-and-Project, Part I: Stemless TAP

Approximating the Sum of Correlated Lognormals: An Implementation

Asymptotic Validity of the Bayes-Inspired Indifference Zone Procedure: The Non-Normal Known Variance Case

Averaging for SDE-BSDE with null recurrent fast component Application to homogenization in a non periodic media

Bayesian Joint Hierarchical Model for Prediction of Latent Health States with Application to Active Surveillance of Prostate Cancer

Bi-log-concave distribution functions

Bounds and power means for the general Randic index

Chromatic bases for symmetric functions

Colored Bin Packing

Compressed Cliques Graphs, Clique Coverings and Positive Zero Forcing

Convergence Implications via Dual Flow Method

Convergence in Density in Finite Time Windows and the Skorohod Representation

Convolution of probability measures on Lie groups and homogenous spaces

Core partitions with distinct parts

Critical surface of the hexagonal polygon model

Cycle lengths and minimum degree of graphs

Decentralized Online Optimization with Global Objectives and Local Communication

Dictionary Learning for Blind One Bit Compressed Sensing

Dimensions of random covering sets in Riemann manifolds

Directional Decision Lists and Identification of Problem Symptoms

Dynamics of a stochastic ratio-dependent predator-prey model

Energy solutions of KPZ are unique

Ethnicity sensitive author disambiguation using semi-supervised learning

Explicit Expressions for the Variance and Higher Moments of the Size of a Simultaneous Core Partition and its Limiting Distribution

Feature Selection via Binary Simultaneous Perturbation Stochastic Approximation

Forecasting Exchange Rates Using Time Series Analysis: The sample of the currency of Kazakhstan

Generalized Gompertz-power series distributions

Helly’s Theorem: New Variations and Applications

Hydrostatics and dynamical large deviations for a reaction-diffusion model

Intersection-Link Representations of Graphs

Interval scheduling maximizing minimum coverage

Intrinsic Bayesian Analysis for Occupancy Models

Invariance principles for random walks in cones

Khovanov-Rozansky Homology and Directed Cycles

Kuratowski monoids of $n$-topological spaces

Large-scale heterogeneous service systems with general packing constraints

Learning to Aggregate Information for Sequential Inferences

Linked Component Analysis from Matrices to High Order Tensors: Applications to Biomedical Data

Local and global robustness in conjugate Bayesian analysis

Massively Parallel Algorithms for the Lattice Boltzmann Method on Non-uniform Grids

Maximal regularity for abstract stochastic linear evolution equations

Maximum Persistency via Iterative Relaxed Inference with Graphical Models

Metastability for the Ising Model on the hypercube

Model Guided Sampling Optimization for Low-dimensional Problems

Model selection in logistic regression

Moment estimates for chaoses generated by symmetric random variables with logarithmically convex tails

Neural Machine Translation of Rare Words with Subword Units

Non-negative submodular stochastic probing via stochastic contention resolution schemes

On graphs uniquely defined by their $k$-circular matroids

On recursive predictive distributions

On the completeness and sufficiency of uniformly minimum variance unbiased estimators

On the contraction properties of some high-dimensional quasi-posterior distributions

On the trace of random walks on random graphs

On zero-sum $\mathbb{Z}_{2j}^k$-magic graphs

Online Model Evaluation in a Large-Scale Computational Advertising Platform

Parallel Approximate Steady-state Analysis of Large Probabilistic Boolean Networks (Technical Report)

Parameter Sensitivity Analysis of the Energy/Frequency Convexity Rule for Nanometer-scale Application Processors

Percolation of even sites for enhanced random sequential adsorption

Poincaré-type Inequalities for Singular Stable-Like Dirichlet Forms

Power of Graph-Based Two-Sample Tests

Query Workload-based RDF Graph Fragmentation and Allocation

Random conical tessellations

Rate functions for symmetric markov processes via heat kernel

Relative complements and a `switch’-classification of simple graphs

Remarks on spectral gaps on the Riemannian path space

Revival of oscillation from mean-field-induced death: Theory and experiment

SDL based validation of a node monitoring protocol

Simultaneous Embeddings with Few Bends and Crossings

Some New Methods for Constructing 4-critical Planar Graphs

Some results on abstract stochastic semilinear evolution equations

Spatiotemporal Intermittency on the Growing Surface of Coupled Sandpiles

Statistically-estimated tree composition for the northeastern United States at the time of Euro-American settlement

Strong rate of convergence for the Euler-Maruyama approximation of SDEs with Hölder continuous drift coefficient

Synchronization versus neighborhood similarity in complex networks of non-identical oscillators

The $q$-analog of Kostant’s partition function and the highest root of the classical Lie algebras

The infinite rate symbiotic branching model: from discrete to continuous space

The inverse first-passage problem and optimal stopping

The linear strand of determinantal facet ideals

The rectilinear local crossing number of $K_n$

Tight chiral polyhedra

Two descent statistics over 321-avoiding centrosymmetric involutions

Weak Tail Conditions for Local Martingales

X-TREPAN: a multi class regression and adapted extraction of comprehensible decision tree in artificial neural networks