Angrier Birds: Bayesian reinforcement learning

We train a reinforcement learner to play a simplified version of the game Angry Birds. The learner is provided with a game state in a manner similar to the output that could be produced by computer vision algorithms. We improve on the efficiency of regular {\epsilon}-greedy Q-Learning with linear function approximation through more systematic exploration in Randomized Least Squares Value Iteration (RLSVI), an algorithm that samples its policy from a posterior distribution on optimal policies. With larger state-action spaces, efficient exploration becomes increasingly important, as evidenced by the faster learning in RLSVI.

Recurrent Memory Network for Language Modeling

Recurrent Neural Networks (RNN) have obtained excellent result in many natural language processing (NLP) tasks. However, understanding and interpreting the source of this success remains a challenge. In this paper, we propose Recurrent Memory Network (RMN), a novel RNN architecture, that not only amplifies the power of RNN but also facilitates our understanding of its internal functioning and allows us to discover underlying patterns in data. We demonstrate the power of RMN on language modeling and sentence completion tasks. On language modeling, RMN outperforms Long Short-Term Memory (LSTM) network on three large German, Italian, and English dataset. Additionally we perform in-depth analysis of various linguistic dimensions that RMN captures. On Sentence Completion Challenge, for which it is essential to capture sentence coherence, our RMN obtains 69.2% accuracy, surpassing the previous state-of-the-art by a large margin.

Some Experimental Issues in Financial Fraud Detection: An Investigation

Financial fraud detection is an important problem with a number of design aspects to consider. Issues such as algorithm selection and performance analysis will affect the perceived ability of proposed solutions, so for auditors and re-searchers to be able to sufficiently detect financial fraud it is necessary that these issues be thoroughly explored. In this paper we will revisit the key performance metrics used for financial fraud detection with a focus on credit card fraud, critiquing the prevailing ideas and offering our own understandings. There are many different performance metrics that have been employed in prior financial fraud detection research. We will analyse several of the popular metrics and compare their effectiveness at measuring the ability of detection mechanisms. We further investigated the performance of a range of computational intelligence techniques when applied to this problem domain, and explored the efficacy of several binary classification methods.

A simple technique for improving multi-class classification with neural networks

We present a novel method to perform multi-class pattern classification with neural networks and test it on a challenging 3D hand gesture recognition problem. Our method consists of a standard one-against-all (OAA) classification, followed by another network layer classifying the resulting class scores, possibly augmented by the original raw input vector. This allows the network to disambiguate hard-to-separate classes as the distribution of class scores carries considerable information as well, and is in fact often used for assessing the confidence of a decision. We show that by this approach we are able to significantly boost our results, overall as well as for particular difficult cases, on the hard 10-class gesture classification task.

Streaming Gibbs Sampling for LDA Model

Streaming variational Bayes (SVB) is successful in learning LDA models in an online manner. However previous attempts toward developing online Monte-Carlo methods for LDA have little success, often by having much worse perplexity than their batch counterparts. We present a streaming Gibbs sampling (SGS) method, an online extension of the collapsed Gibbs sampling (CGS). Our empirical study shows that SGS can reach similar perplexity as CGS, much better than SVB. Our distributed version of SGS, DSGS, is much more scalable than SVB mainly because the updates’ communication complexity is small.

A Survey on Social Media Anomaly Detection

Social media anomaly detection is of critical importance to prevent malicious activities such as bullying, terrorist attack planning, and fraud information dissemination. With the recent popularity of social media, new types of anomalous behaviors arise, causing concerns from various parties. While a large amount of work have been dedicated to traditional anomaly detection problems, we observe a surge of research interests in the new realm of social media anomaly detection. In this paper, we present a survey on existing approaches to address this problem. We focus on the new type of anomalous phenomena in the social media and review the recent developed techniques to detect those special types of anomalies. We provide a general overview of the problem domain, common formulations, existing methodologies and potential directions. With this work, we hope to call out the attention from the research community on this challenging problem and open up new directions that we can contribute in the future.

Absolute Continuity of the Laws of the Solutions to Parabolic SPDEs with Two Reflecting Walls

Language to Logical Form with Neural Attention

Adaptive Approximation of the Minimum of Brownian Motion

The number of dominating $k$-sets of paths, cycles and wheels

Global well-posedness of the dynamic $Φ^4_3$ model on the torus

Further results on arc and bar k-visibility graphs

Adaptive and Efficient Nonlinear Channel Equalization for Underwater Acoustic Communication

Density of 4-edge paths in graphs with fixed edge density

Matroids over hyperfields

Three-coloring triangle-free graphs on surfaces VII. A linear-time algorithm

Part-of-Speech Tagging for Code-mixed Indian Social Media Text at ICON 2015

On Bayesian index policies for sequential resource allocation

On the partial order competition dimensions of chordal graphs

Approximation of backward stochastic differential equations using Malliavin weights and least-squares regression

An intuitive Bayesian spatial model for disease mapping that accounts for scaling

Non-informative reparameterisations for location-scale mixtures

Equivalence between direct and indirect effects with different sets of intermediate variables and covariates

Ruin probability in the three-seasonal discrete-time risk model

Changes in Spatio-temporal Precipitation Patterns in Changing Climate Conditions

On spectral measures of random Jacobi matrices

Distance between exact and approximate distributions of partial maxima under power normalization

On packing dimension preservation by distribution functions of random variables with independent $\tilde{Q}$-digits

Central limit theorems for long range dependent spatial linear processes

Minimal normal graph covers

Option pricing in the model with stochastic volatility driven by Ornstein–Uhlenbeck process. Simulation

Statistical methods for linguistic research: Foundational Ideas – Part I

Bayesian Analysis in Non-linear Non-Gaussian State-Space Models using Particle Gibbs

Block bootstrap for the empirical process of long-range dependent data

A pragmatic approach to multi-class classification

Linear regression by observations from mixture with varying concentrations

Tempered Hermite process

Notions of Connectivity in Overlay Networks

Joint distributions for stochastic functional differential equations

Regularity of Binomial Edge Ideals of Certain Block Graphs

Incorporating Structural Alignment Biases into an Attentional Neural Translation Model

Efficient tensor completion: Low-rank tensor train

Model comparison for dependent generalized linear model

Sparse approximation problem: how rapid simulated annealing succeeds and fails

Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism

The strong total rainbow connection of graphs

Wikiometrics: A Wikipedia Based Ranking System

Growth, collapse, and self-organized criticality in complex networks

Dataflow Graphs as Matrices and Programming with Higher-order Matrix Elements

A new 3-parameter extension of generalized lindley distribution

The Dynamics of the Forest Graph Operator

Estimating Functional Linear Mixed-Effects Regression Models

Stochastic spatial model for the division of labor in social insects

Universal Coating for Programmable Matter

Models, Methods and Network Topology: Experimental Design for the Study of Interference

Large deviations of particle systems in random interaction

Homomorphisms of Strongly Regular Graphs