Scale-Free Online Learning

We design algorithms for online linear optimization that have optimal regret and at the same time do not need to know any upper or lower bounds on the norm of the loss vectors. We achieve adaptiveness to the norms of the loss vectors by scale invariance, i.e., our algorithms make exactly the same decisions if the sequence of loss vectors is multiplied by any positive constant. One of our algorithms works for any decision set, bounded or unbounded. For unbounded decisions sets, this is the first adaptive algorithm for online linear optimization with a non-vacuous regret bound. We also study a popular scale-free variant of online mirror descent algorithm, and we show that in two natural settings it has linear or worse regret.


Numerical Coding of Nominal Data

In this paper, a novel approach for coding nominal data is proposed. For the given nominal data, a rank in a form of complex number is assigned. The proposed method does not lose any information about the attribute and brings other properties previously unknown. The approach based on these knew properties can been used for classification. The analyzed example shows that classification with the use of coded nominal data or both numerical as well as coded nominal data is more effective than the classification, which uses only numerical data.


Song Recommendation with Non-Negative Matrix Factorization and Graph Total Variation

This work formulates a novel song recommender system as a matrix completion problem that benefits from collaborative filtering through Non-negative Matrix Factorization (NMF) and content-based filtering via total variation (TV) on graphs. The graphs encode both playlist proximity information and song similarity, using a rich combination of audio, meta-data and social features. As we demonstrate, our hybrid recommendation system is very versatile and incorporates several well-known methods while outperforming them. Particularly, we show on real-world data that our model overcomes w.r.t. two evaluation metrics the recommendation of models solely based on low-rank information, graph-based information or a combination of both.


Hawkes Graphs

This paper introduces the Hawkes skeleton and the Hawkes graph. These notions summarize the branching structure of a multivariate Hawkes point process in a compact and fertile way. In particular, we explain how the graph view is useful for the specification and estimation of Hawkes models from large, multitype event streams. Based on earlier work, we give a nonparametric statistical procedure to estimate the Hawkes skeleton and the Hawkes graph from data. We show how the graph estimation may then be used for choosing and fitting parametric Hawkes models. Our method avoids the a priori assumptions on the model from a straighforward MLE-approach and it is numerically more flexible than the latter. A simulation study confirms that the presented procedure works as desired. We give special attention to computational issues in the implementation. This makes our results applicable to high-dimensional event-stream data, such as dozens of event streams and thousands of events per component.


A Prefixed-Itemset-Based Improvement For Apriori Algorithm

Association rules is a very important part of data mining. It is used to find the interesting patterns from transaction databases. Apriori algorithm is one of the most classical algorithms of association rules, but it has the bottleneck in efficiency. In this article, we proposed a prefixed-itemset-based data structure for candidate itemset generation, with the help of the structure we managed to improve the efficiency of the classical Apriori algorithm.


Algebraic File Synchronization: Adequacy and Completeness

With distributed computing and mobile applications, synchronizing diverging replicas of data structures is a more and more common problem. We use algebraic methods to reason about filesystem operations, and introduce a simplified definition of conflicting updates to filesystems. We also define algorithms for update detection and reconciliation and present rigorous proofs that they not only work as intended, but also cannot be improved on. To achieve this, we introduce a novel, symmetric set of filesystem commands with higher information content, which removes edge cases and increases the predictive powers of our algebraic model. We also present a number of generally useful classes and properties of sequences of commands. These results are often intuitive, but providing exact proofs for them is far from trivial. They contribute to our understanding of this special type of algebraic model, and toward building more complete algebras of filesystem trees and extending algebraic approaches to other data storage protocols. They also form a theoretical basis for specifying and guaranteeing the error-free operation of applications that implement an algebraic approach to synchronization.


Learning to Compose Neural Networks for Question Answering

We describe a question answering model that applies to both images and structured knowledge bases. The model uses natural language strings to automatically assemble neural networks from a collection of composable modules. Parameters for these modules are learned jointly with network-assembly parameters via reinforcement learning, with only (world, question, answer) triples as supervision. Our approach, which we term a dynamic neural model network, achieves state-of-the-art results on benchmark datasets in both visual and structured domains.


Limit Theorems for Longest Monotone Subsequences in Random Mallows Permuations

A Note on Automatic Data Transformation

Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models

Cox process representation and inference for stochastic reaction-diffusion processes

Hyperelliptic graphs and metrized complexes

On computing tree and path decompositions with metric constraints on the bags

Bayesian Crossover Designs for Generalized Linear Models

On paths, stars and wyes in trees

Nonparametric semi-supervised learning of class proportions

Radial parts of Haar measures and probability distributions on the space of rational matrix-valued functions

A Predictive Model using the Markov Property

Vortices in a Stochastic Parabolic Ginzburg-Landau Equation

Towards Semantic Integration of Heterogeneous Sensor Data with Indigenous Knowledge for Drought Forecasting

Toward a Robust Diversity-Based Model to Detect Changes of Context

The Cost of Global Broadcast in Dynamic Radio Networks

About the Suitability of Clouds in High-Performance Computing

Rough flows and homogenization in stochastic turbulence

Research Project: Text Engineering Tool for Ontological Scientometry

Pathwidth and nonrepetitive list coloring

Dynamic Environments for Virtual Machine Placement considering Elasticity and Overbooking

Symmetry classes of alternating sign matrices in the nineteen-vertex model

Cohomology of Complements of Toric Arrangements Associated to Root Systems

An Extended Empirical Saddlepoint Approximation for Intractable Likelihoods

Derivative and real roots of graph polynomials

Expected number of real roots of random trigonometric polynomials

Acyclicity in Edge-Colored Graphs

Highly incidental patterns on a quadratic hypersurface in $\mathbb{R}^4$

Brownian Bridges on Random Intervals

Advanced Bag-of-Temporal-SIFT-Words for Time Series Classification

On recognising frame and lifted-graphic matroids

Transforming phylogenetic networks: Moving beyond tree space

Algebraic Combinatorics on Trace Monoids: Extending Number Theory to Walks on Graphs

Odd partitions in Young’s lattice

Choosability with limited number of colors

Orthonormal polynomial expansions and lognormal sum densities

Performance of QAOA on Typical Instances of Constraint Satisfaction Problems with Bounded Degree

Integration by Parts Formula and Applications for SPDEs with Jumps

Profiling-Assisted Decoupled Access-Execute

Following the evolution of metastable glassy states under external perturbations: compression and shear-strain