Convergence rates for smooth k-means change-point detection

In this paper, we consider the estimation of a change-point for possibly high-dimensional data in a Gaussian model, using a k-means method. We prove that, up to a logarithmic term, this change-point estimator has a minimax rate of convergence. Then, considering the case of sparse data, with a Sobolev regularity, we propose a smoothing procedure based on Lepski’s method and show that the resulting estimator attains the optimal rate of convergence. Our results are illustrated by some simulations. As the theoretical statement relying on Lepski’s method depends on some unknown constant, practical strategies are suggested to perform an optimal smoothing.

A likelihood ratio approach to sequential change point detection

In this paper we propose a new approach for sequential monitoring of a parameter of a d-dimensional time series. We consider a closed-end-method, which is motivated by the likelihood ratio test principle and compare the new method with two alternative procedures. We also incorporate self-normalization such that estimation of the long-run variance is not necessary. We prove that for a large class of testing problems the new detection scheme has asymptotic level \alpha and is consistent. The asymptotic theory is illustrated for the important cases of monitoring a change in the mean, variance and correlation. By means of a simulation study it is demonstrated that the new test performs better than the currently available procedures for these problems.

Attack Strength vs. Detectability Dilemma in Adversarial Machine Learning

As the prevalence and everyday use of machine learning algorithms, along with our reliance on these algorithms grow dramatically, so do the efforts to attack and undermine these algorithms with malicious intent, resulting in a growing interest in adversarial machine learning. A number of approaches have been developed that can render a machine learning algorithm ineffective through poisoning or other types of attacks. Most attack algorithms typically use sophisticated optimization approaches, whose objective function is designed to cause maximum damage with respect to accuracy and performance of the algorithm with respect to some task. In this effort, we show that while such an objective function is indeed brutally effective in causing maximum damage on an embedded feature selection task, it often results in an attack mechanism that can be easily detected with an embarrassingly simple novelty or outlier detection algorithm. We then propose an equally simple yet elegant solution by adding a regularization term to the attacker’s objective function that penalizes outlying attack points.

Bayesian Incremental Learning for Deep Neural Networks

In industrial machine learning pipelines, data often arrive in parts. Particularly in the case of deep neural networks, it may be too expensive to train the model from scratch each time, so one would rather use a previously learned model and the new data to improve performance. However, deep neural networks are prone to getting stuck in a suboptimal solution when trained on only new data as compared to the full dataset. Our work focuses on a continuous learning setup where the task is always the same and new parts of data arrive sequentially. We apply a Bayesian approach to update the posterior approximation with each new piece of data and find this method to outperform the traditional approach in our experiments.

Continual Lifelong Learning with Neural Networks: A Review

Humans and animals have the ability to continually acquire and fine-tune knowledge throughout their lifespan. This ability is mediated by a rich set of neurocognitive functions that together contribute to the early development and experience-driven specialization of our sensorimotor skills. Consequently, the ability to learn from continuous streams of information is crucial for computational learning systems and autonomous agents (inter)acting in the real world. However, continual lifelong learning remains a long-standing challenge for machine learning and neural network models since the incremental acquisition of new skills from non-stationary data distributions generally leads to catastrophic forgetting or interference. This limitation represents a major drawback also for state-of-the-art deep neural network models that typically learn representations from stationary batches of training data, thus without accounting for situations in which the number of tasks is not known a priori and the information becomes incrementally available over time. In this review, we critically summarize the main challenges linked to continual lifelong learning for artificial learning systems and compare existing neural network approaches that alleviate, to different extents, catastrophic interference. Although significant advances have been made in domain-specific continual lifelong learning with neural networks, extensive research efforts are required for the development of general-purpose artificial intelligence and autonomous agents. We discuss well-established research and recent methodological trends motivated by experimentally observed lifelong learning factors in biological systems. Such factors include principles of neurosynaptic stability-plasticity, critical developmental stages, intrinsically motivated exploration, transfer learning, and crossmodal integration.

Approximation Algorithms for Cascading Prediction Models

We present an approximation algorithm that takes a pool of pre-trained models as input and produces from it a cascaded model with similar accuracy but lower average-case cost. Applied to state-of-the-art ImageNet classification models, this yields up to a 2x reduction in floating point multiplications, and up to a 6x reduction in average-case memory I/O. The auto-generated cascades exhibit intuitive properties, such as using lower-resolution input for easier images and requiring higher prediction confidence when using a computationally cheaper model.

A folded model for compositional data analysis

A folded type model is developed for analyzing compositional data. The proposed model, which is based upon the \alpha-transformation for compositional data, provides a new and flexible class of distributions for modeling data defined on the simplex sample space. Despite its rather seemingly complex structure, employment of the EM algorithm guarantees efficient parameter estimation. The model is validated through simulation studies and examples which illustrate that the proposed model performs better in terms of capturing the data structure, when compared to the popular logistic normal distribution.

Direct Learning to Rank and Rerank

Learning-to-rank techniques have proven to be extremely useful for prioritization problems, where we rank items in order of their estimated probabilities, and dedicate our limited resources to the top-ranked items. This work exposes a serious problem with the state of learning-to-rank algorithms, which is that they are based on convex proxies that lead to poor approximations. We then discuss the possibility of ‘exact’ reranking algorithms based on mathematical programming. We prove that a relaxed version of the ‘exact’ problem has the same optimal solution, and provide an empirical analysis.

A Study into the similarity in generator and discriminator in GAN architecture

One popular generative model that has high-quality results is the Generative Adversarial Networks(GAN). This type of architecture consists of two separate networks that play against each other. The generator creates an output from the input noise that is given to it. The discriminator has the task of determining if the input to it is real or fake. This takes place constantly eventually leads to the generator modeling the target distribution. This paper includes a study into the actual weights learned by the network and a study into the similarity of the discriminator and generator networks. The paper also tries to leverage the similarity between these networks and shows that indeed both the networks may have a similar structure with experimental evidence with a novel shared architecture.

Density-aware Single Image De-raining using a Multi-stream Dense Network

Single image rain streak removal is an extremely challenging problem due to the presence of non-uniform rain densities in images. We present a novel density-aware multi-stream densely connected convolutional neural network-based algorithm, called DID-MDN, for joint rain density estimation and de-raining. The proposed method enables the network itself to automatically determine the rain-density information and then efficiently remove the corresponding rain-streaks guided by the estimated rain-density label. To better characterize rain-streaks with different scales and shapes, a multi-stream densely connected de-raining network is proposed which efficiently leverages features from different scales. Furthermore, a new dataset containing images with rain-density labels is created and used to train the proposed density-aware network. Extensive experiments on synthetic and real datasets demonstrate that the proposed method achieves significant improvements over the recent state-of-the-art methods. In addition, an ablation study is performed to demonstrate the improvements obtained by different modules in the proposed method. Code can be found at: https://…/hezhangsprinter

Globally Consistent Algorithms for Mixture of Experts

Mixture-of-Experts (MoE) is a widely popular neural network architecture and is a basic building block of highly successful modern neural networks, for example, Gated Recurrent Units (GRU) and Attention networks. However, despite the empirical success, finding an efficient and provably consistent algorithm to learn the parameters remains a long standing open problem for more than two decades. In this paper, we introduce the first algorithm that learns the true parameters of a MoE model for a wide class of non-linearities with global consistency guarantees. Our algorithm relies on a novel combination of the EM algorithm and the tensor method of moment techniques. We empirically validate our algorithm on both the synthetic and real data sets in a variety of settings, and show superior performance to standard baselines.

Generalization in Machine Learning via Analytical Learning Theory

This paper introduces a novel measure-theoretic learning theory to analyze generalization behaviors of practical interest. The proposed learning theory has the following abilities: 1) to utilize the qualities of each learned representation on the path from raw inputs to outputs in representation learning, 2) to guarantee good generalization errors possibly with arbitrarily rich hypothesis spaces (e.g., arbitrarily large capacity and Rademacher complexity) and non-stable/non-robust learning algorithms, and 3) to clearly distinguish each individual problem instance from each other. Our generalization bounds are relative to a representation of the data, and hold true even if the representation is learned. We discuss several consequences of our results on deep learning, one-shot learning and curriculum learning. Unlike statistical learning theory, the proposed learning theory analyzes each problem instance individually via measure theory, rather than a set of problem instances via statistics. Because of the differences in the assumptions and the objectives, the proposed learning theory is meant to be complementary to previous learning theory and is not designed to compete with it.

Active Learning with Partial Feedback

In the large-scale multiclass setting, assigning labels often consists of answering multiple questions to drill down through a hierarchy of classes. Here, the labor required per annotation scales with the number of questions asked. We propose active learning with partial feedback. In this setup, the learner asks the annotator if a chosen example belongs to a (possibly composite) chosen class. The answer eliminates some classes, leaving the agent with a partial label. Success requires (i) a sampling strategy to choose (example, class) pairs, and (ii) learning from partial labels. Experiments on the TinyImageNet dataset demonstrate that our most effective method achieves a 21% relative improvement in accuracy for a 200k binary question budget. Experiments on the TinyImageNet dataset demonstrate that our most effective method achieves a 26% relative improvement (8.1% absolute) in top1 classification accuracy for a 250k (or 30%) binary question budget, compared to a naive baseline. Our work may also impact traditional data annotation. For example, our best method fully annotates TinyImageNet with only 482k (with EDC though, ERC is 491) binary questions (vs 827k for naive method).

Matching Long Text Documents via Graph Convolutional Networks

Identifying the relationship between two text objects is a core research problem underlying many natural language processing tasks. A wide range of deep learning schemes have been proposed for text matching, mainly focusing on sentence matching, question answering or query document matching. We point out that existing approaches do not perform well at matching long documents, which is critical, for example, to AI-based news article understanding and event or story formation. The reason is that these methods either omit or fail to fully utilize complicated semantic structures in long documents. In this paper, we propose a graph approach to text matching, especially targeting long document matching, such as identifying whether two news articles report the same event in the real world, possibly with different narratives. We propose the Concept Interaction Graph to yield a graph representation for a document, with vertices representing different concepts, each being one or a group of coherent keywords in the document, and with edges representing the interactions between different concepts, connected by sentences in the document. Based on the graph representation of document pairs, we further propose a Siamese Encoded Graph Convolutional Network that learns vertex representations through a Siamese neural network and aggregates the vertex features though Graph Convolutional Networks to generate the matching result. Extensive evaluation of the proposed approach based on two labeled news article datasets created at Tencent for its intelligent news products show that the proposed graph approach to long document matching significantly outperforms a wide range of state-of-the-art methods.

Learning Image Conditioned Label Space for Multilabel Classification

This work addresses the task of multilabel image classification. Inspired by the great success from deep convolutional neural networks (CNNs) for single-label visual-semantic embedding, we exploit extending these models for multilabel images. Specifically, we propose an image-dependent ranking model, which returns a ranked list of labels according to its relevance to the input image. In contrast to conventional CNN models that learn an image representation (i.e. the image embedding vector), the developed model learns a mapping (i.e. a transformation matrix) from an image in an attempt to differentiate between its relevant and irrelevant labels. Despite the conceptual simplicity of our approach, experimental results on a public benchmark dataset demonstrate that the proposed model achieves state-of-the-art performance while using fewer training images than other multilabel classification methods.

Algorithms and Convergence Results of Projection Methods for Inconsistent Feasibility Problems: A Review

The convex feasibility problem (CFP) is to find a feasible point in the intersection of finitely many convex and closed sets. If the intersection is empty then the CFP is inconsistent and a feasible point does not exist. However, algorithmic research of inconsistent CFPs exists and is mainly focused on two directions. One is oriented toward defining solution concepts other that will apply, such as proximity function minimization wherein a proximity function measures in some way the total violation of all constraints. The second direction investigates the behavior of algorithms that are designed to solve a consistent CFP when applied to inconsistent problems. This direction is fueled by situations wherein one lacks a priory information about the consistency or inconsistency of the CFP or does not wish to invest computational resources to get hold of such knowledge prior to running his algorithm. In this paper we bring under one roof and telegraphically review some recent works on inconsistent CFPs.

The Many Faces of Exponential Weights in Online Learning

A standard introduction to online learning might place Online Gradient Descent at its center and then proceed to develop generalizations and extensions like Online Mirror Descent and second-order methods. Here we explore the alternative approach of putting exponential weights (EW) first. We show that many standard methods and their regret bounds then follow as a special case by plugging in suitable surrogate losses and playing the EW posterior mean. For instance, we easily recover Online Gradient Descent by using EW with a Gaussian prior on linearized losses, and, more generally, all instances of Online Mirror Descent based on regular Bregman divergences also correspond to EW with a prior that depends on the mirror map. Furthermore, appropriate quadratic surrogate losses naturally give rise to Online Gradient Descent for strongly convex losses and to Online Newton Step. We further interpret several recent adaptive methods (iProd, Squint, and a variation of Coin Betting for experts) as a series of closely related reductions to exp-concave surrogate losses that are then handled by Exponential Weights. Finally, a benefit of our EW interpretation is that it opens up the possibility of sampling from the EW posterior distribution instead of playing the mean. As already observed by Bubeck and Eldan, this recovers the best-known rate in Online Bandit Linear Optimization.

Information Theoretic Co-Training

This paper introduces an information theoretic co-training objective for unsupervised learning. We consider the problem of predicting the future. Rather than predict future sensations (image pixels or sound waves) we predict ‘hypotheses’ to be confirmed by future sensations. More formally, we assume a population distribution on pairs (x,y) where we can think of x as a past sensation and y as a future sensation. We train both a predictor model P_\Phi(z|x) and a confirmation model P_\Psi(z|y) where we view z as hypotheses (when predicted) or facts (when confirmed). For a population distribution on pairs (x,y) we focus on the problem of measuring the mutual information between x and y. By the data processing inequality this mutual information is at least as large as the mutual information between x and z under the distribution on triples (x,z,y) defined by the confirmation model P_\Psi(z|y). The information theoretic training objective for P_\Phi(z|x) and P_\Psi(z|y) can be viewed as a form of co-training where we want the prediction from x to match the confirmation from y.

Improving Recommender Systems Beyond the Algorithm

Recommender systems rely heavily on the predictive accuracy of the learning algorithm. Most work on improving accuracy has focused on the learning algorithm itself. We argue that this algorithmic focus is myopic. In particular, since learning algorithms generally improve with more and better data, we propose shaping the feedback generation process as an alternate and complementary route to improving accuracy. To this effect, we explore how changes to the user interface can impact the quality and quantity of feedback data — and therefore the learning accuracy. Motivated by information foraging theory, we study how feedback quality and quantity are influenced by interface design choices along two axes: information scent and information access cost. We present a user study of these interface factors for the common task of picking a movie to watch, showing that these factors can effectively shape and improve the implicit feedback data that is generated while maintaining the user experience.

Deep Collaborative Weight-based Classification

One of the biggest problems in deep learning is its difficulty to retain consistent robustness when transferring the model trained on one dataset to another dataset. To conquer the problem, deep transfer learning was implemented to execute various vision tasks by using a pre-trained deep model in a diverse dataset. However, the robustness was often far from state-of-the-art. We propose a collaborative weight-based classification method for deep transfer learning (DeepCWC). The method performs the L2-norm based collaborative representation on the original images, as well as the deep features extracted by pre-trained deep models. Two distance vectors will be obtained based on the two representation coefficients, and then fused together via the collaborative weight. The two feature sets show a complementary character, and the original images provide information compensating the missed part in the transferred deep model. A series of experiments conducted on both small and large vision datasets demonstrated the robustness of the proposed DeepCWC in both face recognition and object recognition tasks.

Smooth Loss Functions for Deep Top-k Classification

The top-k error is a common measure of performance in machine learning and computer vision. In practice, top-k classification is typically performed with deep neural networks trained with the cross-entropy loss. Theoretical results indeed suggest that cross-entropy is an optimal learning objective for such a task in the limit of infinite data. In the context of limited and noisy data however, the use of a loss function that is specifically designed for top-k classification can bring significant improvements. Our empirical evidence suggests that the loss function must be smooth and have non-sparse gradients in order to work well with deep neural networks. Consequently, we introduce a family of smoothed loss functions that are suited to top-k optimization via deep learning. The widely used cross-entropy is a special case of our family. Evaluating our smooth loss functions is computationally challenging: a na\’ive algorithm would require \mathcal{O}(\binom{n}{k}) operations, where n is the number of classes. Thanks to a connection to polynomial algebra and a divide-and-conquer approach, we provide an algorithm with a time complexity of \mathcal{O}(k n). Furthermore, we present a novel approximation to obtain fast and stable algorithms on GPUs with single floating point precision. We compare the performance of the cross-entropy loss and our margin-based losses in various regimes of noise and data size, for the predominant use case of k=5. Our investigation reveals that our loss is more robust to noise and overfitting than cross-entropy.

A model for system uncertainty in reinforcement learning

This work provides a rigorous framework for studying continuous time control problems in uncertain environments. The framework considered models uncertainty in state dynamics as a measure on the space of functions. This measure is considered to change over time as agents learn their environment. This model can be seem as a variant of either Bayesian reinforcement learning or adaptive control. We study necessary conditions for locally optimal trajectories within this model, in particular deriving an appropriate dynamic programming principle and Hamilton-Jacobi equations. This model provides one possible framework for studying the tradeoff between exploration and exploitation in reinforcement learning.

Learning Multiple Categories on Deep Convolution Networks

Deep convolution networks have proved very successful with big datasets such as the 1000-classes ImageNet. Results show that the error rate increases slowly as the size of the dataset increases. Experiments presented here may explain why these networks are very effective in solving big recognition problems. If the big task is made up of multiple smaller tasks, then the results show the ability of deep convolution networks to decompose the complex task into a number of smaller tasks and to learn them simultaneously. The results show that the performance of solving the big task on a single network is very close to the average performance of solving each of the smaller tasks on a separate network. Experiments also show the advantage of using task specific or category labels in combination with class labels.

Detecting Learning vs Memorization in Deep Neural Networks using Shared Structure Validation Sets

The roles played by learning and memorization represent an important topic in deep learning research. Recent work on this subject has shown that the optimization behavior of DNNs trained on shuffled labels is qualitatively different from DNNs trained with real labels. Here, we propose a novel permutation approach that can differentiate memorization from learning in deep neural networks (DNNs) trained as usual (i.e., using the real labels to guide the learning, rather than shuffled labels). The evaluation of weather the DNN has learned and/or memorized, happens in a separate step where we compare the predictive performance of a shallow classifier trained with the features learned by the DNN, against multiple instances of the same classifier, trained on the same input, but using shuffled labels as outputs. By evaluating these shallow classifiers in validation sets that share structure with the training set, we are able to tell apart learning from memorization. Application of our permutation approach to multi-layer perceptrons and convolutional neural networks trained on image data corroborated many findings from other groups. Most importantly, our illustrations also uncovered interesting dynamic patterns about how DNNs memorize over increasing numbers of training epochs, and support the surprising result that DNNs are still able to learn, rather than only memorize, when trained with pure Gaussian noise as input.

Fairness of Exposure in Rankings
Nonuniversality and strongly interacting two-level systems in glasses at low temperatures
On Spin Distributions for Generic $p$-spin models
Bots sustain and inflate striking opposition in online social systems
On the Connection Between Learning Two-Layers Neural Networks and Tensor Decomposition
MoNet: Moments Embedding Network
Unsupervised Phase Mapping of X-ray Diffraction Data by Nonnegative Matrix Factorization Integrated with Custom Clustering
Detection limits in the high-dimensional spiked rectangular model
Simple proofs and expressions for the restricted partition function and its polynomial part
Achievability Bounds for T-Fold Irregular Repetition Slotted ALOHA Scheme in the Gaussian Multiple Access Channel
A joint decomposition method for global optimization of multiscenario nonconvex mixed-integer nonlinear programs
Cooperative Robot Localization Using Event-triggered Estimation
Reimagine Procrastination: Music Preference and Health Habits as Factors on Self-Perceived Procrastination of Young People
Devon: Deformable Volume Network for Learning Optical Flow
Crystal graphs for shifted tableaux
Positive temperature dynamics on Gelfand-Tsetlin patterns restricted by wall
Mean Field Approximations to a Queueing System with Threshold-Based Workload Control Scheme
On the Statistical Challenges of Echo State Networks and Some Potential Remedies
SufiSent – Universal Sentence Representations Using Suffix Encodings
Sample Complexity of Stochastic Variance-Reduced Cubic Regularization for Nonconvex Optimization
On the scaling of polynomial features for representation matching
Periodicity in Data Streams with Wildcards
Scalable Label Propagation for Multi-relational Learning on Tensor Product Graph
Fast Nonconvex Deconvolution of Calcium Imaging Data
Coresets For Monotonic Functions with Applications to Deep Learning
Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections
Universal Growth in Production Economies
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning
The ‘quantum’ Turan problem for operator systems
Investigating Rumor News Using Agreement-Aware Search
Third-Party Data Providers Ruin Simple Mechanisms
Physics and Human-Based Information Fusion for Improved Resident Space Object Tracking
Metastable state en route to traveling-wave synchronization state
Improved rate for a multi-server coded caching
Angle constrained path to cluster multiple manifolds
Limits for Partial Maxima of Gaussian Random Vectors
Sequence-based Multi-lingual Low Resource Speech Recognition
Conditional Adversarial Synthesis of 3D Facial Action Units
Inapproximability of Matrix $p\rightarrow q$ Norms
Static-Memory-Hard Functions and Nonlinear Space-Time Tradeoffs via Pebbling
Nonparametric Bayesian Sparse Graph Linear Dynamical Systems
Binary Constrained Deep Hashing Network for Image Retrieval without Human Intervention
Constant Factor Approximation Algorithm for Weighted Flow Time on a Single Machine in Pseudo-polynomial time
Max-size popular matchings and extensions
Learning to Play with Intrinsically-Motivated Self-Aware Agents
Scaling-up Split-Merge MCMC with Locality Sensitive Sampling (LSS)
Distributed Compression of Graphical Data
Load Balanced GANs for Multi-view Face Image Synthesis
The asymptotic expansion of the regular discretization error of Itô integrals
A two-class queueing system with constant retrial policy and general class dependent service times
Spatial Morphing Kernel Regression For Feature Interpolation
Asymptotic efficiency of restart and checkpointing
Communication Using Eigenvalues of Higher Multiplicity of the Nonlinear Fourier Transform
Non-Asymptotic Bounds and a General Formula for the Rate-Distortion Region of the Successive Refinement Problem
Emergence of Structured Behaviors from Curiosity-Based Intrinsic Motivation
An Information-Theoretical Analysis of the Minimum Cost to Erase Information
Multiclass Weighted Loss for Instance Segmentation of Cluttered Cells
The Moduli Space of Polynomial Maps and Their Fixed-Point Multipliers: II. Improvement to the Algorithm and Monic Centered Polynomials
Optimal Base Station Antenna Downtilt in Downlink Cellular Networks
Dual Extrapolation for Faster Lasso Solvers
Data-Driven Forecasting of High-Dimensional Chaotic Systems with Long-Short Term Memory Networks
Epistemic Graphs for Representing and Reasoning with Positive and Negative Influences of Arguments
ViTac: Feature Sharing between Vision and Tactile Sensing for Cloth Texture Recognition
The real polynomial eigenvalue problem is well conditioned on the average
Geometric Law for Multiple Returns until a Hazard
Skueue: A Scalable and Sequentially Consistent Distributed Queue
Spectrally approximating large graphs with smaller graphs
Density Weighted Connectivity of Grass Pixels in Image Frames for Biomass Estimation
Adversarial classification: An adversarial risk analysis approach
A framework for cost-constrained genome rearrangement under Double Cut and Join
Packing unequal rectangles and squares in a fixed size circular container using formulation space search
Subspace-Induced Gaussian Processes
Variational solutions to nonlinear stochastic differential equations in Hilbert spaces
Uniqueness of DRS as the 2 Operator Resolvent-Splitting and Impossibility of 3 Operator Resolvent-Splitting
A Generative Deep Recurrent Model for Exchangeable Data
Personal research information system. About developing the methods for searching patent analogs of invention
On detectability of labeled Petri nets with inhibitor arcs
Critical yield numbers and limiting yield surfaces of particle arrays settling in a Bingham fluid
Clipped Action Policy Gradient
Emulating dynamic non-linear simulators using Gaussian processes
Universal Hypothesis Testing with Kernels: Asymptotically Optimal Tests for Goodness of Fit
DeepASL: Enabling Ubiquitous and Non-Intrusive Word and Sentence-Level Sign Language Translation
Batch Normalization and the impact of batch structure on the behavior of deep convolution networks
On a problem of Sárközy and Sós for multivariate linear forms
Design of Irregular SC-LDPC Codes With Non-Uniform Degree Distributions by Linear Programing
Randomized sliding window algorithms for regular languages
Ordered Preference Elicitation Strategies for Supporting Multi-Objective Decision Making
About Kendall’s regression
An exact extremal result for tournaments and 4-uniform hypergraphs
Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives
Neumann Boundary Problem for Parabolic Partial Differential Equations with Divergence Terms
Spanning Tree Congestion and Computation of Generalized Győri-Lov{á}sz Partition
A gauge-invariant reversible cellular automaton
Minimal cut-sets in the power graph of certain finite non-cyclic groups
MIS in the Congested Clique Model in $O(\log \log Δ)$ Rounds
Scalable and Robust Sparse Subspace Clustering Using Randomized Clustering and Multilayer Graphs
Landmark Placement for Localization in a GPS-denied Environment
Building Efficient ConvNets using Redundant Feature Pruning
Classical large deviations theorems on complete Riemannian manifolds
Non-Malleable Codes for Small-Depth Circuits
Multiscale finite elements through advection-induced coordinates for transient advection-diffusion equations
Stochastic Video Generation with a Learned Prior
Predicting Natural Hazards with Neuronal Networks
Global synchronization of partially forced Kuramoto oscillators on Networks
Managing and Querying Multi-versioned Documents using a Distributed Key-Value Store
Identifying H[infinity]-Models: An LMI Approach
A rainbow blow-up lemma
Statistics on some classes of knot shadows
ARRIVAL: Next Stop in CLS
Condition numbers of stochastic mean payoff games and what they say about nonarchimedean semidefinite programming
Cutting Lemma and Union Lemma for the Domination Game