Text Matching as Image Recognition

Matching two texts is a fundamental problem in many natural language processing tasks. An effective way is to extract meaningful matching patterns from words, phrases, and sentences to produce the matching score. Inspired by the success of convolutional neural network in image recognition, where neurons can capture many complicated patterns based on the extracted elementary visual patterns such as oriented edges and corners, we propose to model text matching as the problem of image recognition. Firstly, a matching matrix whose entries represent the similarities between words is constructed and viewed as an image. Then a convolutional neural network is utilized to capture rich matching patterns in a layer-by-layer way. We show that by resembling the compositional hierarchies of patterns in image recognition, our model can successfully identify salient signals such as n-gram and n-term matchings. Experimental results demonstrate its superiority against the baselines.


graphVizdb: A Scalable Platform for Interactive Large Graph Visualization

We present a novel platform for the interactive visualization of very large graphs. The platform enables the user to interact with the visualized graph in a way that is very similar to the exploration of maps at multiple levels. Our approach involves an offline preprocessing phase that builds the layout of the graph by assigning coordinates to its nodes with respect to a Euclidean plane. The respective points are indexed with a spatial data structure, i.e., an R-tree, and stored in a database. Multiple abstraction layers of the graph based on various criteria are also created offline, and they are indexed similarly so that the user can explore the dataset at different levels of granularity, depending on her particular needs. Then, our system translates user operations into simple and very efficient spatial operations (i.e., window queries) in the backend. This technique allows for a fine-grained access to very large graphs with extremely low latency and memory requirements and without compromising the functionality of the tool. Our web-based prototype supports three main operations: (1) interactive navigation, (2) multi-level exploration, and (3) keyword search on the graph metadata.


FLASH: Fast Bayesian Optimization for Data Analytic Pipelines

Modern data science relies on data analytic pipelines to organize interdependent computational steps. Such analytic pipelines often involve different algorithms across multiple steps, each with its own hyperparameters. To get the best performance, it is often critical to select optimal algorithms and set appropriate hyperparameters, which requires large computational efforts. Bayesian optimization provides a principled way for searching optimal hyperparameters for a single algorithm. However, many challenges remain in solving pipeline optimization problems with high-dimensional and highly conditional search space. In this work, we propose Fast LineAr SearcH (FLASH), an efficient method for tuning analytic pipelines. FLASH is a two-layer Bayesian optimization framework, which firstly uses a parametric model to select promising algorithms, then computes a nonparametric model to fine-tune hyperparameters of the promising algorithms. FLASH also includes an effective caching algorithm which can further accelerate the search process. Extensive experiments on a number of benchmark datasets have demonstrated that FLASH significantly outperforms previous state-of-the-art methods in both search speed and accuracy. Using 50% of the time budget, FLASH achieves up to 20% improvement on test error rate compared to the baselines. Our method also yields state-of-the-art performance on a real-world application for healthcare predictive modeling.


Computational Narrative Intelligence: A Human-Centered Goal for Artificial Intelligence

Narrative intelligence is the ability to craft, tell, understand, and respond affectively to stories. We argue that instilling artificial intelligences with computational narrative intelligence affords a number of applications beneficial to humans. We lay out some of the machine learning challenges necessary to solve to achieve computational narrative intelligence. Finally, we argue that computational narrative is a practical step towards machine enculturation, the teaching of sociocultural values to machines.


IOTSim: a Cloud based Simulator for Analysing IoT Applications

A disruptive technology that is influencing not only computing paradigm but every other business is the rise of big data. Internet of Things (IoT) applications are considered to be a major source of big data. Such IoT applications are in general supported through clouds where data is stored and processed by big data processing systems. In order to improve the efficiency of cloud infrastructure so that they can efficiently support IoT big data applications, it is important to understand how these applications and the corresponding big data processing systems will perform in cloud computing environments. However, given the scalability and complex requirements of big data processing systems, an empirical evaluation on actual cloud infrastructure can hinder the development of timely and cost effective IoT solutions. Therefore, a simulator supporting IoT applications in cloud environment is highly demanded, but such work is still in its infancy. To fill this gap, we have designed and implemented IOTSim which supports and enables simulation of IoT big data processing using MapReduce model in cloud computing environment. A real case study validates the efficacy of the simulator.


Machine learning meets network science: dimensionality reduction for fast and efficient embedding of networks in the hyperbolic space

Complex network topologies and hyperbolic geometry seem specularly connected, and one of the most fascinating and challenging problems of recent complex network theory is to map a given network to its hyperbolic space. The Popularity Similarity Optimization (PSO) model represents – at the moment – the climax of this theory. It suggests that the trade-off between node popularity and similarity is a mechanism to explain how complex network topologies emerge – as discrete samples – from the continuous world of hyperbolic geometry. The hyperbolic space seems appropriate to represent real complex networks. In fact, it preserves many of their fundamental topological properties, and can be exploited for real applications such as, among others, link prediction and community detection. Here, we observe for the first time that a topological-based machine learning class of algorithms – for nonlinear unsupervised dimensionality reduction – can directly approximate the network’s node angular coordinates of the hyperbolic model into a two-dimensional space, according to a similar topological organization that we named angular coalescence. On the basis of this phenomenon, we propose a new class of algorithms that offers fast and accurate coalescent embedding of networks in the hyperbolic space even for graphs with thousands of nodes.


Online Bounded Analysis

Though competitive analysis is often a very good tool for the analysis of online algorithms, sometimes it does not give any insight and sometimes it gives counter-intuitive results. Much work has gone into exploring other performance measures, in particular targeted at what seems to be the core problem with competitive analysis: the comparison of the performance of an online algorithm is made to a too powerful adversary. We consider a new approach to restricting the power of the adversary, by requiring that when judging a given online algorithm, the optimal offline algorithm must perform as well as the online algorithm, not just on the entire final request sequence, but also on any prefix of that sequence. This is limiting the adversary’s usual advantage of being able to exploit that it knows the sequence is continuing beyond the current request. Through a collection of online problems, including machine scheduling, bin packing, dual bin packing, and seat reservation, we investigate the significance of this particular offline advantage.


Principal Component Projection Without Principal Component Analysis

We show how to efficiently project a vector onto the top principal components of a matrix, without explicitly computing these components. Specifically, we introduce an iterative algorithm that provably computes the projection using few calls to any black-box routine for ridge regression. By avoiding explicit principal component analysis (PCA), our algorithm is the first with no runtime dependence on the number of top principal components. We show that it can be used to give a fast iterative method for the popular principal component regression problem, giving the first major runtime improvement over the naive method of combining PCA with regression. To achieve our results, we first observe that ridge regression can be used to obtain a ‘smooth projection’ onto the top principal components. We then sharpen this approximation to true projection using a low-degree polynomial approximation to the matrix step function. Step function approximation is a topic of long-term interest in scientific computing. We extend prior theory by constructing polynomials with simple iterative structure and rigorously analyzing their behavior under limited precision.


Clustering with a Reject Option: Interactive Clustering as Bayesian Prior Elicitation

A good clustering can help a data analyst to explore and understand a data set, but what constitutes a good clustering may depend on domain-specific and application-specific criteria. These criteria can be difficult to formalize, even when it is easy for an analyst to know a good clustering when they see one. We present a new approach to interactive clustering for data exploration called TINDER, based on a particularly simple feedback mechanism, in which an analyst can reject a given clustering and request a new one, which is chosen to be different from the previous clustering while fitting the data well. We formalize this interaction in a Bayesian framework as a method for prior elicitation, in which each different clustering is produced by a prior distribution that is modified to discourage previously rejected clusterings. We show that TINDER successfully produces a diverse set of clusterings, each of equivalent quality, that are much more diverse than would be obtained by randomized restarts.


Sharp detection in PCA under correlations: all eigenvalues matter

Principal component analysis (PCA) is a widely used method for dimension reduction. In high dimensional data, the ‘signal’ eigenvalues corresponding to weak principal components (PCs) do not necessarily separate from the bulk of the ‘noise’ eigenvalues. Therefore, popular tests based on the largest eigenvalue have little power to detect weak PCs. In the special case of the spiked model, certain tests asymptotically equivalent to linear spectral statistics (LSS)—averaging effects over all eigenvalues—were recently shown to achieve some power. We consider a nonparametric, non-Gaussian generalization of the spiked model to the setting of Marchenko and Pastur (1967). This allows a general bulk of the noise eigenvalues, accomodating correlated variables even under the null hypothesis of no significant PCs. We develop new tests based on LSS to detect weak PCs in this model. We show using the CLT for LSS that the optimal LSS satisfy a Fredholm integral equation of the first kind. We develop algorithms to solve it, building on our recent method for computing the limit empirical spectrum. In contrast to the standard spiked model, we find that under ‘widely spread’ null eigenvalue distributions, the new tests have a lot of power.


Model-based clustering with data correction for removing artifacts in gene expression data

A unified framework for spline estimators

L^2-Perturbed Markov processes and applications to random walks in dynamic random environments

Policy Error Bounds for Model-Based Reinforcement Learning with Factored Linear Models

Distributed Constraint Optimization Problems and Applications: A Survey

The Segmented iHMM: A Simple, Efficient Hierarchical Infinite HMM

Isotropic Brownian motions over complex fields as a solvable model for May-Wigner stability analysis

Robust Estimation of Propensity Score Weights via Subclassification

Power-Distortion Metrics for Path Planning over Gaussian Sensor Networks

Note: a counterexample to a conjecture of Jackson about hamiltonicity of diregular digraphs

Testing hypotheses about mixture distributions using not identically distributed data

Semidefinite Programs for Exact Recovery of a Hidden Community

On the Size and the Approximability of Minimum Temporally Connected Subgraphs

Localizations of inductively factored arrangements

A loopless and branchless $O(1)$ algorithm to generate the next Dyck word

Generalized Statistical Tests for mRNA and Protein Subcellular Spatial Patterning against Complete Spatial Randomness

Burstiness Scale: a highly parsimonious model for characterizing random series of events

Energy Conservation and Power Bonds in Co-Simulations: Non-Iterative Adaptive Step Size Control and Error Estimation

Random Walks in a Sparse Random Environment

Causes for Query Answers from Databases, Datalog Abduction and View-Updates: The Presence of Integrity Constraints

Undermining and Strengthening Social Networks through Network Modification

The Singularity May Never Be Near

Stochastic areas, Winding numbers and Hopf fibrations

Inequalities for critical exponents in $d$-dimensional sandpiles

Distributed Private Online Learning for Social Big Data Computing over Data Center Networks

Directly Coupled Observers for Quantum Harmonic Oscillators with Discounted Mean Square Cost Functionals and Penalized Back-action

Waiting times and stopping probabilities for patterns in Markov chains

Uniform Hypergraph Partitioning: Provable Tensor Methods and Sampling Techniques

Active Task Selection for Multi-Task Learning

Multi-task and Lifelong Learning of Kernels

Spaces of completions of elementary theories and convergence laws for random hypergraphs

Determining the best attributes for surveillance video keywords generation

Basic enumeration of graph compositions with a restricted number of components

Semi-Markov Switching Vector Autoregressive Model-based Anomaly Detection in Aviation Systems

Deep Learning in Finance

Interactive Storytelling over Document Collections

A multivariate mixed hidden Markov model to analyze blue whale diving behaviour during controlled sound exposures

2-Bit Random Projections, NonLinear Estimators, and Approximate Near Neighbor Search

Analytic queueing model for ambulance services

Recovering Structured Probability Matrices

High-performance generation of the Hamiltonian and Overlap matrices in FLAPW methods

Principal stratification in the Twilight Zone: Weakly separated components in finite mixture models

Detection of faults and intrusions in cyber-physical systems from physical correlations

Large deviations for invariant measures of white-forced 2D Navier-Stokes equation

Estimating Structured Vector Autoregressive Model

Nonparametric and Varying Coefficient Modal Regression

Mixture of Regression Models with Single-Index

Clustering subgaussian mixtures by semidefinite programming

A Duality in Buchsbaum rings and triangulated manifolds

A randomized algorithm for enumerating zonotope vertices

Mock Threshold Graphs

Dyson’s spike for random Schroedinger operators and Novikov-Shubin invariants of groups

Nested critical points for a directed polymer on a disordered diamond lattice

Denoising and Covariance Estimation of Single Particle Cryo-EM Images

Structured Learning of Binary Codes with Column Generation

Optimal Nonpreemptive Scheduling in a Smart Grid Model

Orthogonal RNNs and Long-Memory Tasks

A Geometric Analysis of Phase Retrieval

Motion Planning Strategies for Autonomously Mapping 3D Structures

EDCC 2015 – Fast Abstracts & Student Forum Proceedings

Designing a Disaster-resilient Network with Software Defined Networking

An Effective and Efficient Approach for Clusterability Evaluation

siEDM: an efficient string index and search algorithm for edit distance with moves

Preconditioning Kernel Matrices

A note on basis dimension selection in generalized additive modelling

Inference Networks for Sequential Monte Carlo in Graphical Models

On the Hardness of Partially Dynamic Graph Problems and Connections to Diameter

Distributed Deep Learning Using Synchronous Stochastic Gradient Descent

Stability result for sets with $3A\ne{\mathbb Z}_5^n$

A Method for Detecting Life-Threatening Signals in Serum Potassium Level after Myocardial Infarction

Existence of a plateau for the flux of tasep with site disorder

Clique coloring $B_1$-EPG graphs

Variational inference for Monte Carlo objectives

Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Trajectory Error Training

Reflection groups, arrangements, and invariant real varieties

On commutative $p$-schemes of order $p^4$

Convexification of Learning from Constraints

Simple t-designs: A recursive construction for arbitrary t

Temporal correlations of the running maximum of a Brownian trajectory

Shortest path embeddings of graphs on surfaces

Packing minor-closed families of graphs into complete graphs

Fast IDS Computing System Method and its Memristor Crossbar-based Hardware Implementation

Cubic symmetry breaking in $φ^4$-theory with disorder by operator product expansion

Counting distinct dimer hex tilings

Semi-supervised Clustering for Short Text via Deep Representation Learning

The maximum weight stable set problem in $(P_6,\mbox{bull})$-free graphs

Graph Regularized Low Rank Representation for Aerosol Optical Depth Retrieval

Fast Online k-nn Graph Building

Understanding Visual Concepts with Continuation Learning

Long induced paths in graphs

Scaling of Harmonic Oscillator Eigenfunctions and Their Nodal Sets Around the Caustic

Tsallis statistics in the income distribution of Brazil

Higher-Order Low-Rank Regression

The Push the button algorithm for contragredient Lie superalgebras

Joint ML calibration and DOA estimation with separated arrays

The Matsu Wheel: A Cloud-based Framework for Efficient Analysis and Reanalysis of Earth Satellite Imagery

Extension complexity of polytopes with few vertices or facets

Enablers and Inhibitors in Causal Justifications of Logic Programs

Periodicity in Rectangular Arrays

Sparse Linear Regression via Generalized Orthogonal Least-Squares

Fast Cross-Polytope Locality-Sensitive Hashing

Matching Matrix Bernstein with Little Memory: Near-Optimal Finite Sample Guarantees for Oja’s Algorithm

On the thin-shell conjecture for the Schatten classes