Angrier Birds: Bayesian reinforcement learning
We train a reinforcement learner to play a simplified version of the game Angry Birds. The learner is provided with a game state in a manner similar to the output that could be produced by computer vision algorithms. We improve on the efficiency of regular {\epsilon}-greedy Q-Learning with linear function approximation through more systematic exploration in Randomized Least Squares Value Iteration (RLSVI), an algorithm that samples its policy from a posterior distribution on optimal policies. With larger state-action spaces, efficient exploration becomes increasingly important, as evidenced by the faster learning in RLSVI.
Recurrent Memory Network for Language Modeling
Recurrent Neural Networks (RNN) have obtained excellent result in many natural language processing (NLP) tasks. However, understanding and interpreting the source of this success remains a challenge. In this paper, we propose Recurrent Memory Network (RMN), a novel RNN architecture, that not only amplifies the power of RNN but also facilitates our understanding of its internal functioning and allows us to discover underlying patterns in data. We demonstrate the power of RMN on language modeling and sentence completion tasks. On language modeling, RMN outperforms Long Short-Term Memory (LSTM) network on three large German, Italian, and English dataset. Additionally we perform in-depth analysis of various linguistic dimensions that RMN captures. On Sentence Completion Challenge, for which it is essential to capture sentence coherence, our RMN obtains 69.2% accuracy, surpassing the previous state-of-the-art by a large margin.
Some Experimental Issues in Financial Fraud Detection: An Investigation
Financial fraud detection is an important problem with a number of design aspects to consider. Issues such as algorithm selection and performance analysis will affect the perceived ability of proposed solutions, so for auditors and re-searchers to be able to sufficiently detect financial fraud it is necessary that these issues be thoroughly explored. In this paper we will revisit the key performance metrics used for financial fraud detection with a focus on credit card fraud, critiquing the prevailing ideas and offering our own understandings. There are many different performance metrics that have been employed in prior financial fraud detection research. We will analyse several of the popular metrics and compare their effectiveness at measuring the ability of detection mechanisms. We further investigated the performance of a range of computational intelligence techniques when applied to this problem domain, and explored the efficacy of several binary classification methods.
A simple technique for improving multi-class classification with neural networks
We present a novel method to perform multi-class pattern classification with neural networks and test it on a challenging 3D hand gesture recognition problem. Our method consists of a standard one-against-all (OAA) classification, followed by another network layer classifying the resulting class scores, possibly augmented by the original raw input vector. This allows the network to disambiguate hard-to-separate classes as the distribution of class scores carries considerable information as well, and is in fact often used for assessing the confidence of a decision. We show that by this approach we are able to significantly boost our results, overall as well as for particular difficult cases, on the hard 10-class gesture classification task.
Streaming Gibbs Sampling for LDA Model
Streaming variational Bayes (SVB) is successful in learning LDA models in an online manner. However previous attempts toward developing online Monte-Carlo methods for LDA have little success, often by having much worse perplexity than their batch counterparts. We present a streaming Gibbs sampling (SGS) method, an online extension of the collapsed Gibbs sampling (CGS). Our empirical study shows that SGS can reach similar perplexity as CGS, much better than SVB. Our distributed version of SGS, DSGS, is much more scalable than SVB mainly because the updates’ communication complexity is small.
A Survey on Social Media Anomaly Detection
Social media anomaly detection is of critical importance to prevent malicious activities such as bullying, terrorist attack planning, and fraud information dissemination. With the recent popularity of social media, new types of anomalous behaviors arise, causing concerns from various parties. While a large amount of work have been dedicated to traditional anomaly detection problems, we observe a surge of research interests in the new realm of social media anomaly detection. In this paper, we present a survey on existing approaches to address this problem. We focus on the new type of anomalous phenomena in the social media and review the recent developed techniques to detect those special types of anomalies. We provide a general overview of the problem domain, common formulations, existing methodologies and potential directions. With this work, we hope to call out the attention from the research community on this challenging problem and open up new directions that we can contribute in the future.
• Absolute Continuity of the Laws of the Solutions to Parabolic SPDEs with Two Reflecting Walls
• Language to Logical Form with Neural Attention
• Adaptive Approximation of the Minimum of Brownian Motion
• The number of dominating $k$-sets of paths, cycles and wheels
• Global well-posedness of the dynamic $Φ^4_3$ model on the torus
• Further results on arc and bar k-visibility graphs
• Adaptive and Efficient Nonlinear Channel Equalization for Underwater Acoustic Communication
• Density of 4-edge paths in graphs with fixed edge density
• Matroids over hyperfields
• Three-coloring triangle-free graphs on surfaces VII. A linear-time algorithm
• Part-of-Speech Tagging for Code-mixed Indian Social Media Text at ICON 2015
• On Bayesian index policies for sequential resource allocation
• On the partial order competition dimensions of chordal graphs
• Approximation of backward stochastic differential equations using Malliavin weights and least-squares regression
• An intuitive Bayesian spatial model for disease mapping that accounts for scaling
• Non-informative reparameterisations for location-scale mixtures
• Equivalence between direct and indirect effects with different sets of intermediate variables and covariates
• Ruin probability in the three-seasonal discrete-time risk model
• Changes in Spatio-temporal Precipitation Patterns in Changing Climate Conditions
• On spectral measures of random Jacobi matrices
• Distance between exact and approximate distributions of partial maxima under power normalization
• On packing dimension preservation by distribution functions of random variables with independent $\tilde{Q}$-digits
• Central limit theorems for long range dependent spatial linear processes
• Minimal normal graph covers
• Option pricing in the model with stochastic volatility driven by Ornstein–Uhlenbeck process. Simulation
• Statistical methods for linguistic research: Foundational Ideas – Part I
• Bayesian Analysis in Non-linear Non-Gaussian State-Space Models using Particle Gibbs
• Block bootstrap for the empirical process of long-range dependent data
• A pragmatic approach to multi-class classification
• Linear regression by observations from mixture with varying concentrations
• Tempered Hermite process
• Notions of Connectivity in Overlay Networks
• Joint distributions for stochastic functional differential equations
• Regularity of Binomial Edge Ideals of Certain Block Graphs
• Incorporating Structural Alignment Biases into an Attentional Neural Translation Model
• Efficient tensor completion: Low-rank tensor train
• Model comparison for dependent generalized linear model
• Sparse approximation problem: how rapid simulated annealing succeeds and fails
• Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism
• The strong total rainbow connection of graphs
• Wikiometrics: A Wikipedia Based Ranking System
• Growth, collapse, and self-organized criticality in complex networks
• Dataflow Graphs as Matrices and Programming with Higher-order Matrix Elements
• A new 3-parameter extension of generalized lindley distribution
• The Dynamics of the Forest Graph Operator
• Estimating Functional Linear Mixed-Effects Regression Models
• Stochastic spatial model for the division of labor in social insects
• Universal Coating for Programmable Matter
• Models, Methods and Network Topology: Experimental Design for the Study of Interference
• Large deviations of particle systems in random interaction
• Homomorphisms of Strongly Regular Graphs
Like this:
Like Loading...