Block-Structure Based Time-Series Models For Graph Sequences

Although the computational and statistical trade-off for modeling single graphs, for instance using block models, is relatively well understood, extending such results to sequences of graphs has proven to be difficult. In this work, we propose two models for graph sequences that capture: (a) link persistence between nodes across time, and (b) community persistence of each node across time. In the first model, we assume that the latent community of each node does not change over time, and in the second model we relax this assumption suitably. For both of these proposed models, we provide computationally efficient inference algorithms, whose unique feature is that they leverage community detection methods that work on single graphs. We also provide experimental results validating the suitability of the models and the performance of our algorithms on synthetic instances.

Learning Manifolds from Non-stationary Streaming Data

Streaming adaptations of manifold learning based dimensionality reduction methods, such as Isomap, typically assume that the underlying data distribution is stationary. Such methods are not equipped to detect or handle sudden changes or gradual drifts in the distribution generating the stream. We prove that a Gaussian Process Regression (GPR) model that uses a manifold-specific kernel function and is trained on an initial batch of sufficient size, can closely approximate the state-of-art streaming Isomap algorithm. The predictive variance obtained from the GPR prediction is then shown to be an effective detector of changes in the underlying data distribution. Results on several synthetic and real data sets show that the resulting algorithm can effectively learns lower dimensional representation of high dimensional data in a streaming setting, while identify shifts in the generative distribution.

Genesis of Basic and Multi-Layer Echo State Network Recurrent Autoencoders for Efficient Data Representations

It is a widely accepted fact that data representations intervene noticeably in machine learning tools. The more they are well defined the better the performance results are. Feature extraction-based methods such as autoencoders are conceived for finding more accurate data representations from the original ones. They efficiently perform on a specific task in terms of 1) high accuracy, 2) large short term memory and 3) low execution time. Echo State Network (ESN) is a recent specific kind of Recurrent Neural Network which presents very rich dynamics thanks to its reservoir-based hidden layer. It is widely used in dealing with complex non-linear problems and it has outperformed classical approaches in a number of tasks including regression, classification, etc. In this paper, the noticeable dynamism and the large memory provided by ESN and the strength of Autoencoders in feature extraction are gathered within an ESN Recurrent Autoencoder (ESN-RAE). In order to bring up sturdier alternative to conventional reservoir-based networks, not only single layer basic ESN is used as an autoencoder, but also Multi-Layer ESN (ML-ESN-RAE). The new features, once extracted from ESN’s hidden layer, are applied to classification tasks. The classification rates rise considerably compared to those obtained when applying the original data features. An accuracy-based comparison is performed between the proposed recurrent AEs and two variants of an ELM feed-forward AEs (Basic and ML) in both of noise free and noisy environments. The empirical study reveals the main contribution of recurrent connections in improving the classification performance results.

On-Demand Big Data Integration: A Hybrid ETL Approach for Reproducible Scientific Research

Scientific research requires access, analysis, and sharing of data that is distributed across various heterogeneous data sources at the scale of the Internet. An eager ETL process constructs an integrated data repository as its first step, integrating and loading data in its entirety from the data sources. The bootstrapping of this process is not efficient for scientific research that requires access to data from very large and typically numerous distributed data sources. a lazy ETL process loads only the metadata, but still eagerly. Lazy ETL is faster in bootstrapping. However, queries on the integrated data repository of eager ETL perform faster, due to the availability of the entire data beforehand. In this paper, we propose a novel ETL approach for scientific data integration, as a hybrid of eager and lazy ETL approaches, and applied both to data as well as metadata. This way, Hybrid ETL supports incremental integration and loading of metadata and data from the data sources. We incorporate a human-in-the-loop approach, to enhance the hybrid ETL, with selective data integration driven by the user queries and sharing of integrated data between users. We implement our hybrid ETL approach in a prototype platform, Obidos, and evaluate it in the context of data sharing for medical research. Obidos outperforms both the eager ETL and lazy ETL approaches, for scientific research data integration and sharing, through its selective loading of data and metadata, while storing the integrated data in a scalable integrated data repository.

Parameter Transfer Unit for Deep Neural Networks

Parameters in deep neural networks which are trained on large-scale databases can generalize across multiple domains, which is referred as ‘transferability’. Unfortunately, the transferability is usually defined as discrete states and it differs with domains and network architectures. Existing works usually heuristically apply parameter-sharing or fine-tuning, and there is no principled approach to learn a parameter transfer strategy. To address the gap, a parameter transfer unit (PTU) is proposed in this paper. The PTU learns a fine-grained nonlinear combination of activations from both the source and the target domain networks, and subsumes hand-crafted discrete transfer states. In the PTU, the transferability is controlled by two gates which are artificial neurons and can be learned from data. The PTU is a general and flexible module which can be used in both CNNs and RNNs. Experiments are conducted with various network architectures and multiple transfer domain pairs. Results demonstrate the effectiveness of the PTU as it outperforms heuristic parameter-sharing and fine-tuning in most settings.

A Theory of Statistical Inference for Ensuring the Robustness of Scientific Results

Inference is the process of using facts we know to learn about facts we do not know. A theory of inference gives assumptions necessary to get from the former to the latter, along with a definition for and summary of the resulting uncertainty. Any one theory of inference is neither right nor wrong, but merely an axiom that may or may not be useful. Each of the many diverse theories of inference can be valuable for certain applications. However, no existing theory of inference addresses the tendency to choose, from the range of plausible data analysis specifications consistent with prior evidence, those that inadvertently favor one’s own hypotheses. Since the biases from these choices are a growing concern across scientific fields, and in a sense the reason the scientific community was invented in the first place, we introduce a new theory of inference designed to address this critical problem. We derive ‘hacking intervals,’ which are the range of a summary statistic one may obtain given a class of possible endogenous manipulations of the data. Hacking intervals require no appeal to hypothetical data sets drawn from imaginary superpopulations. A scientific result with a small hacking interval is more robust to researcher manipulation than one with a larger interval, and is often easier to interpret than a classical confidence interval. Some versions of hacking intervals turn out to be equivalent to classical confidence intervals, which means they may also provide a more intuitive and potentially more useful interpretation of classical confidence intervals

Composite Inference for Gaussian Processes

Large-scale Gaussian process models are becoming increasingly important and widely used in many areas, such as, computer experiments, stochastic optimization via simulation, and machine learning using Gaussian processes. The standard methods, such as maximum likelihood estimation (MLE) for parameter estimation and the best linear unbiased predictor (BLUP) for prediction, are generally the primary choices in many applications. In spite of their merits, those methods are not feasible due to intractable computation when the sample size is huge. A novel method for the purposes of parameter estimation and prediction is proposed to solve the computational problems of large-scale Gaussian process based models, by separating the original dataset into tractable subsets. This method consistently combines parameter estimation and prediction by making full use of the dependence among conditional densities: a statistically efficient composite likelihood based on joint distributions of some well selected conditional densities is developed to estimate parameters and then ‘composite inference’ is coined to make prediction for an unknown input point, based on its distributions conditional on each block subset. The proposed method transforms the intractable BLUP into a tractable convex optimization problem. It is also shown that the prediction given by the proposed method has a minimum variance for a given separation of the dataset.

Data-driven Summarization of Scientific Articles

Data-driven approaches to sequence-to-sequence modelling have been successfully applied to short text summarization of news articles. Such models are typically trained on input-summary pairs consisting of only a single or a few sentences, partially due to limited availability of multi-sentence training data. Here, we propose to use scientific articles as a new milestone for text summarization: large-scale training data come almost for free with two types of high-quality summaries at different levels – the title and the abstract. We generate two novel multi-sentence summarization datasets from scientific articles and test the suitability of a wide range of existing extractive and abstractive neural network-based summarization approaches. Our analysis demonstrates that scientific papers are suitable for data-driven text summarization. Our results could serve as valuable benchmarks for scaling sequence-to-sequence models to very long sequences.

An Information-Theoretic View for Deep Learning

Deep learning has transformed the computer vision, natural language processing and speech recognition. However, the following two critical questions are remaining obscure: (1) why deep neural networks generalize better than shallow networks? (2) Does it always hold that a deeper network leads to better performance? Specifically, letting L be the number of convolutional and pooling layers in a deep neural network, and n be the size of the training sample, we derive the upper bound on the expected generalization error for this network, i.e., \begin{eqnarray*} \mathbb{E}[R(W)-R_S(W)] \leq \exp{\left(-\frac{L}{2}\log{\frac{1}{\eta}}\right)}\sqrt{\frac{2\sigma^2}{n}I(S,W) } \end{eqnarray*} where \sigma >0 is a constant depending on the loss function, 0<\eta<1 is a constant depending on the information loss for each convolutional or pooling layer, and I(S, W) is the mutual information between the training sample S and the output hypothesis W. This upper bound discovers: (1) As the network increases its number of convolutional and pooling layers L, the expected generalization error will decrease exponentially to zero. Layers with strict information loss, such as the convolutional layers, reduce the generalization error of deep learning algorithms. This answers the first question. However, (2) algorithms with zero expected generalization error does not imply a small test error or \mathbb{E}[R(W)]. This is because \mathbb{E}[R_S(W)] will be large when the information for fitting the data is lost as the number of layers increases. This suggests that the claim ‘the deeper the better’ is conditioned on a small training error or \mathbb{E}[R_S(W)].

No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

Though impressive results have been achieved in visual captioning, the task of generating abstract stories from photo streams is still a little-tapped problem. Different from captions, stories have more expressive language styles and contain many imaginary concepts that do not appear in the images. Thus it poses challenges to behavioral cloning algorithms. Furthermore, due to the limitations of automatic metrics on evaluating story quality, reinforcement learning methods with hand-crafted rewards also face difficulties in gaining an overall performance boost. Therefore, we propose an Adversarial REward Learning (AREL) framework to learn an implicit reward function from human demonstrations, and then optimize policy search with the learned reward function. Though automatic evaluation indicates slight performance boost over state-of-the-art (SOTA) methods in cloning expert behaviors, human evaluation shows that our approach achieves significant improvement in generating more human-like stories than SOTA systems.

Realistic Evaluation of Deep Semi-Supervised Learning Algorithms

Semi-supervised learning (SSL) provides a powerful framework for leveraging unlabeled data when labels are limited or expensive to obtain. SSL algorithms based on deep neural networks have recently proven successful on standard benchmark tasks. However, we argue that these benchmarks fail to address many issues that these algorithms would face in real-world applications. After creating a unified reimplementation of various widely-used SSL techniques, we test them in a suite of experiments designed to address these issues. We find that the performance of simple baselines which do not use unlabeled data is often underreported, that SSL methods differ in sensitivity to the amount of labeled and unlabeled data, and that performance can degrade substantially when the unlabeled dataset contains out-of-class examples. To help guide SSL research towards real-world applicability, we make our unified reimplemention and evaluation platform publicly available.

A quadratic penalty algorithm for linear programming and its application to linearizations of quadratic assignment problems

This paper provides the first meaningful documentation and analysis of an established technique which aims to obtain an approximate solution to linear programming problems prior to applying the primal simplex method. The underlying algorithm is a penalty method with naive approximate minimization in each iteration. During initial iterations an approach similar to augmented Lagrangian is used. Later the technique corresponds closely to a classical quadratic penalty method. There is also a discussion of the extent to which it can be used to obtain fast approximate solutions of LP problems, in particular when applied to linearizations of quadratic assignment problems.

Every planar graph without 4-cycles adjacent to two triangles is DP-4-colorable
Faster Response in Bounded-Update-Rate, Discrete-time Networks using Delayed Self-Reinforcement
The Obstacle Problem for Quasilinear Stochastic Integral-Partial Differential Equations
Robust Decentralized Navigation of Multi-Agent Systems with Collision Avoidance and Connectivity Maintenance Using Model Predictive Controllers
A symmetric formula for hypergeometric series
Modular Arithmetic Erasure Channels and Their Multilevel Channel Polarization
Descriptor Selection via Self-Paced Learning for Bioactivity of Molecular Structure in QSAR Classification
A universal model for neuromorphic computing and learning
Distributed Distributional Deterministic Policy Gradients
State Distribution-aware Sampling for Deep Q-learning
The Obstacle Problem for Quasilinear Stochastic PDEs with Degenerate Operator
The Obstacle Problem for Quasilinear Stochastic PDEs with Neumann boundary condition
Analysis of Hannan Consistent Selection for Monte Carlo Tree Search in Simultaneous Move Games
Reachability and Distances under Multiple Changes
Nonequilibrium quantum order at infinite temperature: spatiotemporal correlations and their generating functions
Adaptive Beam Tracking with the Unscented Kalman Filter for Millimeter Wave Communication
Individual Sensitivity Preprocessing for Data Privacy
Bayesian Test and Selection for Bandwidth of High-dimensional Banded Precision Matrices
Rendition: Reclaiming what a black box takes away
Improved Initialization for Nonlinear State-Space Modeling
Fingerprint Match in Box
Small-Set Expansion in Shortcode Graph and the 2-to-2 Conjecture
A pseudo-likelihood approach for multivariate meta-analysis of test accuracy studies with multiple thresholds
Using Aspect Extraction Approaches to Generate Review Summaries and User Profiles
Influencing Flock Formation in Low-Density Settings
Ocean Plume Tracking with Unmanned Surface Vessels: Algorithms and Experiments
Data-Driven Investigative Journalism For Connectas Dataset
Gesture based Human-Swarm Interactions for Formation Control using interpreters
*-Balanced Fuzzy graphs
How to Realize a Graph on Random Points
Alternating sign trapezoids and a constant term approach
Boltzmann Encoded Adversarial Machines
Maximum Integer Flows in Directed Planar Graphs with Multiple Sources and Sinks and Vertex Capacities
Crawling in Rogue’s dungeons with (partitioned) A3C
A Boundary Local Time For One-Dimensional Super-Brownian Motion And Applications
Simultaneous shot inversion for nonuniform geometries using fast data interpolation
A novel distribution-free hybrid regression model for manufacturing process efficiency improvement
Discovering Style Trends through Deep Visually Aware Latent Item Embeddings
A constrained regression model for an ordinal response with ordinal predictors
On the 4-girth-thickness of the line graph of the complete graph
Vertically constrained Motzkin-like paths inspired by bobbin lace
Identification of Potential Hazardous Events for an Unmanned Protective Vehicle
Longest Common Factor Made Fully Dynamic
goSLP: Globally Optimized Superword Level Parallelism Framework
Inertial, corrected, primal–dual proximal splitting
Bayesian Updating and Uncertainty Quantification using Sequential Tempered MCMC with the Rank-One Modified Metropolis Algorithm
An Envelope for Davis-Yin Splitting and Strict Saddle Point Avoidance
Splitting tessellations in spherical spaces
Statistical Estimation of Conditional Shannon Entropy
Proof of the Gorenstein Interval Conjecture in low socle degree
Discovery of Driving Patterns by Trajectory Segmentation
Can Eye Movement Data Be Used As Ground Truth For Word Embeddings Evaluation?
A machine learning model for identifying cyclic alternating patterns in the sleeping brain
Detecting Syntactic Features of Translated Chinese
Siamese Generative Adversarial Privatizer for Biometric Data
Switchable Temporal Propagation Network
Analyzing and Characterizing User Intent in Information-seeking Conversations
Is My Matched Dataset As-If Randomized, More, Or Less? Unifying the Design and Analysis of Observational Studies
Low-frequency vibrational modes of stable glasses
Real-Time Stochastic Predictive Control for Hybrid Vehicle Energy Management
A Call for Clarity in Reporting BLEU Scores
Neural-Brane: Neural Bayesian Personalized Ranking for Attributed Network Embedding
Low Resource Black-Box End-to-End Attack Against State of the Art API Call Based Malware Classifiers
Towards an Unsupervised Entrainment Distance in Conversational Speech using Deep Neural Networks
Face Recognition: Primates in the Wild
A Tight 4/3 Approximation for Capacitated Vehicle Routing in Trees
Towards Dependable Deep Convolutional Neural Networks (CNNs) with Out-distribution Learning
SimpleQuestions Nearly Solved: A New Upperbound and Baseline Approach
Large fluctuations of the KPZ equation in a half-space
Structured SUMCOR Multiview Canonical Correlation Analysis for Large-Scale Data
A comparison of methods for modeling marginal non-zero daily rainfall across the Australian continent
The $(p,q)$-spectral radii of $(r,s)$-directed hypergraphs
$β$-saturations of Misere Nim
Lower Bounds for Special Cases of Syntactic Multilinear ABPs
Two-Channel Critically-Sampled Graph Wavelets With Spectral Domain Sampling
End-Task Oriented Textual Entailment via Deep Exploring Inter-Sentence Interactions
On efficiency, savings, wealth transfers and risk-aversion in electricity markets with uncertain supply
Fast and Efficient Distributed Computation of Hamiltonian Cycles in Random Graphs
In-Browser Split-Execution Support for Interactive Analytics in the Cloud
A Continuous Time GARCH(p,q) Process with Delay
Characterizing Allegheny County Opioid Overdoses with an Interactive Data Explorer and Synthetic Prediction Tool
Explaining hyperspectral imaging based plant disease identification: 3D CNN and saliency maps
Measuring and Computing Database Inconsistency via Repairs
Matlab Implementation of Machine Vision Algorithm on Ballast Degradation Evaluation
A lower bound for the $k$-multicolored sum-free problem in $\mathbb{Z}^n_m$
Measuring the Intrinsic Dimension of Objective Landscapes
Efficient Nonlinear Precoding for Massive MU-MIMO Downlink Systems with 1-Bit DACs
Between hard and soft thresholding: optimal iterative thresholding algorithms
Integrating Multiplicative Features into Supervised Distributional Methods for Lexical Entailment
DeepEmo: Learning and Enriching Pattern-Based Emotion Representations
Increasing Achievable Information Rates via Geometric Shaping
Phonon transport and vibrational excitations in amorphous solids
Parallel computing as a congestion game
Spatiotemporal Learning of Dynamic Gestures from 3D Point Cloud Data
Learning to See the Invisible: End-to-End Trainable Amodal Instance Segmentation
Spatial structure of quasi-localized vibrations in nearly jammed amorphous solids
Homocentric Hypersphere Feature Embedding for Person Re-identification
On Local Antimagic Chromatic Number of Graphs
Representing the Unkown – Impact of Uncertainty on the Interaction between Decision Making and Trajectory Generation
Assessment of Deep Convolutional Neural Networks for Road Surface Classification
The Douglas–Rachford algorithm for a hyperplane and a doubleton
Assessing Language Models with Scaling Properties
Mask-aware Photorealistic Face Attribute Manipulation
Polynomial Kernels for Hitting Forbidden Minors under Structural Parameterizations
SIRIUS-LTG-UiO at SemEval-2018 Task 7: Convolutional Neural Networks with Shortest Dependency Paths for Semantic Relation Extraction and Classification in Scientific Papers
Stochastically perturbed bred vectors in multi-scale systems
Segmentation of Scanning Tunneling Microscopy Images Using Variational Methods and Empirical Wavelets
A multi-level collaborative filtering method that improves recommendations
Classifying variable-structures: a general framework
Learning Software Constraints via Installation Attempts
Decoupled mild solutions of path-dependent PDEs and IPDEsrepresented by BSDEs driven by cadlag martingales
Improved Algorithms for Fully Dynamic Maximal Independent Set
Communication channels in safety analysis: An industrial exploratory case study
Accurate 3-D Reconstruction with RGB-D Cameras using Depth Map Fusion and Pose Refinement
Scheduled Multi-Task Learning: From Syntax to Translation
How to generalize (and not to generalize) the Chu–Vandermonde identity
Throughput and Energy-Efficient Network Slicing
Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints
Optimization of Weighted Individual Energy Efficiencies in Interference Network
Graphs with sparsity order at most two: The complex case
On Access Control in Cabin-Based Transport Systems
Nilpotent Graph
How swarm size during evolution impacts the behavior, generalizability, and brain complexity of animats performing a spatial navigation task
Internal relation between Personality trait Statistical outcomes among Junior College Divers and their performance
Mining Automatically Estimated Poses from Video Recordings of Top Athletes
Stochastic integration in quasi-Banach spaces
Improved Local Search Based Approximation Algorithm for Hard Uniform Capacitated k-Median Problem
Deep Neural Network Based Subspace Learning of Robotic Manipulator Workspace Mapping
Efficient Equalization Method for Cyclic Prefix-Free Coarsely Quantized Massive MIMO Systems
A new class of convolutional codes and its use in the McEliece Cryptosystem
Data-driven regularization of Wasserstein barycenters with an application to multivariate density registration
Correlation Tracking via Joint Discrimination and Reliability Learning
One simple remark concerning the uniform value
FaceShop: Deep Sketch-based Face Image Editing
Tighter Connections Between Formula-SAT and Shaving Logs
Rate-Distortion Theory for General Sets and Measures
Computational Approaches for Stochastic Shortest Path on Succinct MDPs
Feedback Control Goes Wireless: Guaranteed Stability over Low-power Multi-hop Networks
Domination game and minimal edge cuts
Infrared and visible image fusion using Latent Low-Rank Representation
Resource Allocation and Interference Management in OFDMA-based VLC Networks
Style Transfer Through Back-Translation
An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches
Cubes3D: Neural Network based Optical Flow in Omnidirectional Image Scenes
Towards a Neural Network Approach to Abstractive Multi-Document Summarization
On robust stopping times for detecting changes in distribution
Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition
Estimate and Replace: A Novel Approach to Integrating Deep Neural Networks with Existing Applications
The $Q_2$-free process in the hypercube
Optimal User Scheduling in Energy Harvesting Wireless Networks
Is it possible to retrieve soil-moisture content from measured VNIR hyperspectral data?
Unsupervised Neural Machine Translation with Weight Sharing
A reaction network scheme which implements the EM algorithm
ECO: Efficient Convolutional Network for Online Video Understanding
Novelty and Conventionality in International Research Collaboration
Structural evolution of amorphous polymeric nitrogen from \textit{ab initio} molecular dynamics simulations and evolutionary search
Propagation of content similarity through a collaborative network for live show recommendation
Multi-objective Architecture Search for CNNs
An Introduction to Quantum Filtering
Semi-supervised Content-based Detection of Misinformation via Tensor Embeddings
On the Polynomiality of moments of sizes for random $(n, dn\pm 1)$-core partitions with distinct parts
Sparse Power Factorization: Balancing peakiness and sample complexity
Complete positivity and distance-avoiding sets
Human-level Performance On Automatic Head Biometrics In Fetal Ultrasound Using Fully Convolutional Neural Networks
A Duality-Based Approach for Distributed Optimization with Coupling Constraints
SITAN: Services for Fault-Tolerant Ad Hoc Networks with Unknown Participants
Towards Semantic SLAM: Points, Planes and Objects
Keep it Unreal: Bridging the Realism Gap for 2.5D Recognition with Geometry Priors Only
dockChain: A Solution for Electric Vehicles Charge Point Anxiety
On Optimal Index Codes for Interlinked Cycle Structures with Outer Cycles
PULP-HD: Accelerating Brain-Inspired High-Dimensional Computing on a Parallel Ultra-Low Power Platform
On Multilinear Forms: Bias, Correlation, and Tensor Rank
A Report on the Complex Word Identification Shared Task 2018
Improving Native Ads CTR Prediction by Large Scale Event Embedding and Recurrent Networks
Small-angle X-ray scattering in amorphous silicon: A computational study
Differences of Type I error rates using SAS and SPSS for repeated measures designs
Seer: Leveraging Big Data to Navigate the Increasing Complexity of Cloud Debugging
Automated Detection of Adverse Drug Reactions in the Biomedical Literature Using Convolutional Neural Networks and Biomedical Word Embeddings
Layered Fields for Natural Tessellations on Surfaces
Optimal Investment and Derivative Demand Under Price Impact
An Integrated Framework for AI Assisted Level Design in 2D Platformers
DOOM Level Generation using Generative Adversarial Networks
A More Fine-Grained Complexity Analysis of Finding the Most Vital Edges for Undirected Shortest Paths
Robust and Approximately Stable Marriages under Partial Information
Quasi-Static Large Deviations
A Non-Invasive Method for the Safe Interaction of Cities and Electric Vehicle Fleets