A Chain-Detection Algorithm for Two-Dimensional Grids
We describe a general method of detecting valid chains or links of pieces on a two-dimensional grid. Specifically, using the example of the chess variant known as Switch-Side Chain-Chess (SSCC). Presently, no foolproof method of detecting such chains in any given chess position is known and existing graph theory, to our knowledge, is unable to fully address this problem either. We therefore propose a solution implemented and tested using the C++ programming language. We have been unable to find an incorrect result and therefore offer it as the most viable solution thus far to the chain-detection problem in this chess variant. The algorithm is also scalable, in principle, to areas beyond two-dimensional grids such as 3D analysis and molecular chemistry.
A Paradigm for Situated and Goal-Driven Language Learning
A distinguishing property of human intelligence is the ability to flexibly use language in order to communicate complex ideas with other humans in a variety of contexts. Research in natural language dialogue should focus on designing communicative agents which can integrate themselves into these contexts and productively collaborate with humans. In this abstract, we propose a general situated language learning paradigm which is designed to bring about robust language agents able to cooperate productively with humans.
How Many Components should be Retained from a Multivariate Time Series PCA?
We report on the results of two new approaches to considering how many principal components to retain from an analysis of a multivariate time series. The first is by using a ‘heat map’ based approach. A heat map in this context refers to a series of principal component coefficients created by applying a sliding window to a multivariate time series. Furthermore the heat maps can provide detailed insights into the evolution of the structure of each principal component over time. The second is by examining the change of the angle of the principal component over time within the high-dimensional data space. We provide evidence that both are useful in studying structure and evolution of a multivariate time series.
On statistical learning via the lens of compression
This work continues the study of the relationship between sample compression schemes and statistical learning, which has been mostly investigated within the framework of binary classification. The central theme of this work is establishing equivalences between learnability and compressibility, and utilizing these equivalences in the study of statistical learning theory. We begin with the setting of multiclass categorization (zero/one loss). We prove that in this case learnability is equivalent to compression of logarithmic sample size, and that uniform convergence implies compression of constant size. We then consider Vapnik’s general learning setting: we show that in order to extend the compressibility-learnability equivalence to this case, it is necessary to consider an approximate variant of compression. Finally, we provide some applications of the compressibility-learnability equivalences: (i) Agnostic-case learnability and realizable-case learnability are equivalent in multiclass categorization problems (in terms of sample complexity). (ii) This equivalence between agnostic-case learnability and realizable-case learnability does not hold for general learning problems: There exists a learning problem whose loss function takes just three values, under which agnostic-case and realizable-case learnability are not equivalent. (iii) Uniform convergence implies compression of constant size in multiclass categorization problems. Part of the argument includes an analysis of the uniform convergence rate in terms of the graph dimension, in which we improve upon previous bounds. (iv) A dichotomy for sample compression in multiclass categorization problems: If a non-trivial compression exists then a compression of logarithmic size exists. (v) A compactness theorem for multiclass categorization problems.
Towards a Theoretical Analysis of PCA for Heteroscedastic Data
Principal Component Analysis (PCA) is a method for estimating a subspace given noisy samples. It is useful in a variety of problems ranging from dimensionality reduction to anomaly detection and the visualization of high dimensional data. PCA performs well in the presence of moderate noise and even with missing data, but is also sensitive to outliers. PCA is also known to have a phase transition when noise is independent and identically distributed; recovery of the subspace sharply declines at a threshold noise variance. Effective use of PCA requires a rigorous understanding of these behaviors. This paper provides a step towards an analysis of PCA for samples with heteroscedastic noise, that is, samples that have non-uniform noise variances and so are no longer identically distributed. In particular, we provide a simple asymptotic prediction of the recovery of a one-dimensional subspace from noisy heteroscedastic samples. The prediction enables: a) easy and efficient calculation of the asymptotic performance, and b) qualitative reasoning to understand how PCA is impacted by heteroscedasticity (such as outliers).
Fast Training of Convolutional Neural Networks via Kernel Rescaling
Training deep Convolutional Neural Networks (CNN) is a time consuming task that may take weeks to complete. In this article we propose a novel, theoretically founded method for reducing CNN training time without incurring any loss in accuracy. The basic idea is to begin training with a pre-train network using lower-resolution kernels and input images, and then refine the results at the full resolution by exploiting the spatial scaling property of convolutions. We apply our method to the ImageNet winner OverFeat and to the more recent ResNet architecture and show a reduction in training time of nearly 20% while test set accuracy is preserved in both cases.
Image Based Camera Localization: an Overview
Recently, virtual reality, augmented reality, robotics, self-driving cars et al attractive much attention of industrial community, in which image based camera localization is a key task. It is urgent to give an overview of image based camera localization. In this paper, an overview of image based camera localization is presented. It will be useful to not only researchers but also engineers.
Optimistic Semi-supervised Least Squares Classification
The goal of semi-supervised learning is to improve supervised classifiers by using additional unlabeled training examples. In this work we study a simple self-learning approach to semi-supervised learning applied to the least squares classifier. We show that a soft-label and a hard-label variant of self-learning can be derived by applying block coordinate descent to two related but slightly different objective functions. The resulting soft-label approach is related to an idea about dealing with missing data that dates back to the 1930s. We show that the soft-label variant typically outperforms the hard-label variant on benchmark datasets and partially explain this behaviour by studying the relative difficulty of finding good local minima for the corresponding objective functions.
The Weak Efficient Market Hypothesis in Light of Statistical Learning
We make an unprecedented evaluation of statistical learning methods to forecast daily returns. Using a randomization test to adjust for data snooping, several models are found statistically significant on the tested equity indices: CSI 300, FTSE, and S&P 500. A best Sharpe ratio portfolio has abnormal returns on the S&P 500, breaking even with the market at 10 bps in round trip costs. The returns produce statistically significant intercept for factor regression models, qualifying as a new anomalous 3-day crisis persistency factor. These results open the path towards a standardized usage of statistical learning methods in finance.
R-Linear Convergence of Limited Memory Steepest Descent
The limited memory steepest descent method (LMSD) proposed by Fletcher is an extension of the Barzilai-Borwein ‘two-point step size’ strategy for steepest descent methods for solving unconstrained optimization problems. It is known that the Barzilai-Borwein strategy yields a method with an R-linear rate of convergence when it is employed to minimize a strongly convex quadratic. This paper extends this analysis for LMSD, also for strongly convex quadratics. In particular, it is shown that the method is R-linearly convergent for any choice of the history length parameter. The results of numerical experiments are provided to illustrate behaviors of the method that are revealed through the theoretical analysis.
• Dynamically enriched topological orders in driven two-dimensional systems
• NMR lineshape of $^{29}$Si in single-crystal silicon
• Sparse Channel Estimation for Massive MIMO with 1-bit Feedback per Dimension
• Enhancing Secrecy with Multi-Antenna Transmission in Millimeter Wave Vehicular Communication Systems
• Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model
• Supermarket Queueing System in the Heavy Traffic Regime. Short Queue Dynamics
• Improved Parallel Construction of Wavelet Trees and Rank/Select Structures
• Cooperative Strategies for Wireless-Powered Communications
• Pattern avoidance and fiber bundle structures on Schubert varieties
• Capacity bounds for distributed storage
• Quantum automata cannot detect biased coins, even in the limit
• Visual Place Recognition with Probabilistic Vertex Voting
• Monotone Empirical Bayes Estimators for the Reproduction Number in Borel-Tanner Distribution
• Finding Bidder-Optimal Core Points Quickly
• Robust self-testing of many-qubit states
• On Seneta-Heyde Scaling for a stable branching random walk
• Minimax Filter: Learning to Preserve Privacy from Inference Attacks
• An efficient multiple imputation algorithm for control-based and delta-adjusted pattern mixture models using SAS
• QCMA hardness of ground space connectivity for commuting Hamiltonians
• New families of Strictly optimal Frequency hopping sequence sets
• Combinatorial differential operators in: Faà di Bruno formula, enumeration of ballot paths, enriched rooted trees and increasing rooted trees
• Subspace clustering based on low rank representation and weighted nuclear norm minimization
• Maximum entropy models for generation of expressive music
• Composite likelihood inference for spatio-temporal data on multicolor cell growth
• Correlations between real and complex zeros of a random polynomial
• Law of large numbers for the SIR model with random vertex weights on Erdős-Rényi graph
• The Analysis of Local Motion and Deformation in Image Sequences Inspired by Physical Electromagnetic Interaction
• A Model of Virtual Carrier Immigration in Digital Images for Region Segmentation
• The Virtual Electromagnetic Interaction between Digital Images for Image Matching with Shifting Transformation
• Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs
• RetiNet: Automatic AMD identification in OCT volumetric data
• Analyzing the Affect of a Group of People Using Multi-modal Framework
• An Inter-User Interference Suppression Method in Full-Duplex Networks
• Multi-Task Curriculum Transfer Deep Learning of Clothing Attributes
• Deep Fruit Detection in Orchards
• Recovering asymmetric communities in the stochastic block model
• Light Field Compression with Disparity Guided Sparse Coding based on Structural Key Views
• Smallest not $C_{2l+1}$-colourable graphs of odd-girth $2k+1$
• Bayesian models for data missing not at random in health examination surveys
• A $q$-Robinson-Schensted-Knuth Algorithm and a $q$-polymer
• Local asymptotic normality property for fractional Gaussian noise under high-frequency observations
• Localization in High-Dimensional Monte Carlo Filtering
• Structure Properties of Koch Networks Based on Networks Dynamical Systems
• Generating captions without looking beyond objects
• Dynamic R&D Competition under Uncertainty and Strategic Disclosure
• Backward stochastic differential equations with Young drift
• Discovering Small Target Sets in Social Networks: A Fast and Effective Algorithm
• Post Selection Inference with Kernels
• Fair user traffic association in cache equipped cellular networks
• Burst Transmission Symbol Synchronization in the Presence of Cycle Slip Arising from Different Clock Frequencies
• Exploring the Entire Regularization Path for the Asymmetric Cost Linear Support Vector Machine
• Dividing goods and bads under additive utilities
• Semi-supervised Discovery of Informative Tweets During the Emerging Disasters
• Language Models with GloVe Word Embeddings
• Detecting Unseen Falls from Wearable Devices using Channel-wise Ensemble of Autoencoders
• Large subgraphs in pseudo-random graphs
• Technical Report: Improved Fourier Reconstruction using Jump Information with Applications to MRI
• SentiHood: Targeted Aspect Based Sentiment Analysis Dataset for Urban Neighbourhoods
• Parallelizing Stochastic Approximation Through Mini-Batching and Tail-Averaging
• Sharp exponential inequalities in survey sampling: conditional Poisson sampling schemes
• Deep disentangled representations for volumetric reconstruction
• Video Depth-From-Defocus
• Change-point detection in high-dimensional covariance structure
• Computing the Expected Value and Variance of Geometric Measures
• Decentralized Coded Caching with Distinct Cache Capacities
• Introduction to the ‘Industrial Benchmark’
• On Probabilistic Checking in Perfect Zero Knowledge
• Robust Scheduling for Flexible Processing Networks
• Domain-specific Question Generation from a Knowledge Base
• Lyndon word decompositions and pseudo orbits on q-nary graphs
• A Continuous Model of Cortical Connectivity
• Disparity of clustering coefficients in the Holme-Kim network model
• Recursive Diffeomorphism-Based Regression for Shape Functions
• Wilson loop expectations in $SU(N)$ lattice gauge theory
• Cooperation driven by success-driven group formation
• Variational approximation of functionals defined on 1-dimensional connected sets: the planar case
Like this:
Like Loading...