Privacy of Dependent Users Against Statistical Matching

Modern applications significantly enhance user experience by adapting to each user’s individual condition and/or preferences. While this adaptation can greatly improve a user’s experience or be essential for the application to work, the exposure of user data to the application presents a significant privacy threat to the users\textemdash even when the traces are anonymized (since the statistical matching of an anonymized trace to prior user behavior can identify a user and their habits). Because of the current and growing algorithmic and computational capabilities of adversaries, provable privacy guarantees as a function of the degree of anonymization and obfuscation of the traces are necessary. Our previous work has established the requirements on anonymization and obfuscation in the case that data traces are independent between users. However, the data traces of different users will be dependent in many applications, and an adversary can potentially exploit such. In this paper, we consider the impact of correlation between user traces on their privacy. First, we demonstrate that the adversary can readily identify the association graph, revealing which user data traces are correlated. Next, we demonstrate that the adversary can use this association graph to break user privacy with significantly shorter traces than in the case of independent users, and that obfuscating data traces independently across users is often insufficient to remedy such leakage. Finally, we discuss how users can employ data trace dependency to improve privacy by performing better obfuscation.


High-Performance Parallel Implementation of Genetic Algorithm on FPGA

Genetic Algorithms (GAs) are used to solve search and optimization problems in which an optimal solution can be found using an iterative process with probabilistic and non-deterministic transitions. However, depending on the problem’s nature, the time required to find a solution can be high in sequential machines due to the computational complexity of genetic algorithms. This work proposes a parallel implementation of a genetic algorithm on field-programmable gate array (FPGA). Optimization of the system’s processing time is the main goal of this project. Results associated with the processing time and area occupancy (on FPGA) for various population sizes are analyzed. Studies concerning the accuracy of the GA response for the optimization of two variables functions were also evaluated for the hardware implementation. However, the high-performance implementation proposes in this paper is able to work with more variable from some adjustments on hardware architecture.


CR-GAN: Learning Complete Representations for Multi-view Generation

Generating multi-view images from a single-view input is an essential yet challenging problem. It has broad applications in vision, graphics, and robotics. Our study indicates that the widely-used generative adversarial network (GAN) may learn ‘incomplete’ representations due to the single-pathway framework: an encoder-decoder network followed by a discriminator network. We propose CR-GAN to address this problem. In addition to the single reconstruction path, we introduce a generation sideway to maintain the completeness of the learned embedding space. The two learning pathways collaborate and compete in a parameter-sharing manner, yielding considerably improved generalization ability to ‘unseen’ dataset. More importantly, the two-pathway framework makes it possible to combine both labeled and unlabeled data for self-supervised learning, which further enriches the embedding space for realistic generations. The experimental results prove that CR-GAN significantly outperforms state-of-the-art methods, especially when generating from ‘unseen’ inputs in wild conditions.


Compressed Sensing Beyond the IID and Static Domains: Theory, Algorithms and Applications

Sparsity is a ubiquitous feature of many real world signals such as natural images and neural spiking activities. Conventional compressed sensing utilizes sparsity to recover low dimensional signal structures in high ambient dimensions using few measurements, where i.i.d measurements are at disposal. However real world scenarios typically exhibit non i.i.d and dynamic structures and are confined by physical constraints, preventing applicability of the theoretical guarantees of compressed sensing and limiting its applications. In this thesis we develop new theory, algorithms and applications for non i.i.d and dynamic compressed sensing by considering such constraints. In the first part of this thesis we derive new optimal sampling-complexity tradeoffs for two commonly used processes used to model dependent temporal structures: the autoregressive processes and self-exciting generalized linear models. Our theoretical results successfully recovered the temporal dependencies in neural activities, financial data and traffic data. Next, we develop a new framework for studying temporal dynamics by introducing compressible state-space models, which simultaneously utilize spatial and temporal sparsity. We develop a fast algorithm for optimal inference on such models and prove its optimal recovery guarantees. Our algorithm shows significant improvement in detecting sparse events in biological applications such as spindle detection and calcium deconvolution. Finally, we develop a sparse Poisson image reconstruction technique and the first compressive two-photon microscope which uses lines of excitation across the sample at multiple angles. We recovered diffraction-limited images from relatively few incoherently multiplexed measurements, at a rate of 1.5 billion voxels per second.


XGBoost: Scalable GPU Accelerated Learning

We describe the multi-GPU gradient boosting algorithm implemented in the XGBoost library (https://…/xgboost ). Our algorithm allows fast, scalable training on multi-GPU systems with all of the features of the XGBoost library. We employ data compression techniques to minimise the usage of scarce GPU memory while still allowing highly efficient implementation. Using our algorithm we show that it is possible to process 115 million training instances in under three minutes on a publicly available cloud computing instance. The algorithm is implemented using end-to-end GPU parallelism, with prediction, gradient calculation, feature quantisation, decision tree construction and evaluation phases all computed on device.


A scalable H-matrix approach for the solution of boundary integral equations on multi-GPU clusters
Team assembly mechanisms and the knowledge produced in the Mexico’s National Institute of Geriatrics: a network analysis and agent-based modelling approach
Neural-net-induced Gaussian process regression for function approximation and PDE solution
Building a path-integral calculus
A hybrid deep learning approach for medical relation extraction
Temporal disorder in discontinuous non-equilibrium phase transitions: general results
Comment on: Decomposition of structural learning about directed acyclic graphs [1]
Discussion on Using Stacking to Average Bayesian Predictive Distributions by Yao et al
A novel distributed secondary frequency regulation scheme for power networks with high order turbine governor dynamics
Footwear Size Recommendation System
Neural Network Cognitive Engine for Autonomous and Distributed Underlay Dynamic Spectrum Access
Understanding Fashionability: What drives sales of a style
A NUMA-Aware Provably-Efficient Task-Parallel Platform Based on the Work-First Principle
Deep Learning Based Instance Segmentation in 3D Biomedical Images Using Weak Annotation
Adversarial Reprogramming of Neural Networks
Augmented Hilbert series of numerical semigroups
Statistical Description of Transport in Multimode Fibers with Mode-Dependent Loss
Evaluation of adaptive treatment strategies in an observational study where time-varying covariates are not monitored systematically
Secrecy Beamforming for SWIPT MISO Heterogeneous Cellular Networks
3D Normal Coordinate Systems for Cortical Areas
Real-Time Optimal Power Flow under Wind Energy Penetration-Part I: Approach
Real-Time Optimal Power Flow under Wind Energy Penetration-Part II: Implementation
Scoring Alternative Forecast Distributions: Completing the Kullback Distance Complex
Cross-Discourse and Multilingual Exploration of Textual Corpora with the DualNeighbors Algorithm
A New Angle on L2 Regularization
Fully Distributed Cooperative Charging for Plug-in Electric Vehicles in Constrained Power Networks
On the Decoding of Polar Codes on Permuted Factor Graphs
List-three-coloring graphs with no induced $P_6+rP_3$
Quit When You Can: Efficient Evaluation of Ensembles with Ordering Optimization
Polynomial-time probabilistic reasoning with partial observations via implicit learning in probability logics
Optimal LQG Control under Delay-dependent Costly Information
Stanley symmetric functions for signed involutions
Proxy Fairness
Towards An Intelligent Deployment of Wireless Sensor Networks
Adversarial and Perceptual Refinement for Compressed Sensing MRI Reconstruction
Subject2Vec: Generative-Discriminative Approach from a Set of Image Patches to a Vector
Using Exposure Mappings as Side Information in Experiments with Interference
A Bootstrap Method for Goodness of Fit and Model Selection with a Single Observed Network
Tight Prediction Intervals Using Expanded Interval Minimization
Active query-driven visual search using probabilistic bisection and convolutional neural networks
In Vivo Communication in Wireless Body Area Networks
A Multimodal Recommender System for Large-scale Assortment Generation in E-commerce
Additivity Assessment in Nonparametric Models Using Ratio of Pseudo Marginal Likelihoods
Human Action Recognition and Prediction: A Survey
Back stable Schubert calculus
An Influence Network Model to Study Discrepancies in Expressed and Private Opinions
Nonparametric competing risks analysis using Bayesian Additive Regression Trees (BART)
On Shige Peng’s central limit theorem
Rock, Paper, Scissors, Etc – The Theory of Regular Tournaments
Learning Multi-Step Robotic Tasks from Observation
A Graphon Approach to Limiting Spectral Distributions of Wigner-type Matrices
Neural Machine Translation with Key-Value Memory-Augmented Attention
Energy Optimization for Cellular-Connected Multi-UAV Mobile Edge Computing Systems with Multi-Access Schemes
Opinion Dynamics with Stubborn Agents
Path-ZVA: general, efficient and automated importance sampling for highly reliable Markovian systems
Hierarchical Dirichlet Process-based Open Set Recognition
On Hypergraph Lagrangians and Frankl-Füredi’s Conjecture
A Simple Characterization of Proportionally 2-choosable Graphs
Gated Feedback Refinement Network for Coarse-to-Fine Dense Semantic Image Labeling
A 28/37/39GHz Multiband Linear Doherty Power Amplifier in Silicon for 5G Applications
Action Recognition for Depth Video using Multi-view Dynamic Images
Multicasting Energy and Information Simultaneously
Exact persistence exponent for the $2d$-diffusion equation and related Kac polynomials
Generating Connected Random Graphs
Shifted Laplacian multigrid for the elastic Helmholtz equation
Approximation Algorithms for Complex-Valued Ising Models on Bounded Degree Graphs
Availability and Reliability of Wireless Links in 5G Systems: A Space-Time Approach
On The Ruin Problem With Investment When The Risky Asset Is A Semimartingale
Properties of the weighted log-rank test under delayed effects assumption in the design of confirmatory studies with delayed effects
A General Multi-agent Epistemic Planner Based on Higher-order Belief Change
A Low-Latency List Successive-Cancellation Decoding Implementation for Polar Codes
Generate the corresponding Image from Text Description using Modified GAN-CLS Algorithm
Dependence in Propositional Logic: Formula-Formula Dependence and Formula Forgetting — Application to Belief Update and Conservative Extension
Excavate Condition-invariant Space by Intrinsic Encoder
Definable Inapproximability: New Challenges for Duplicator
Guaranteed Deterministic Bounds on the Total Variation Distance between Univariate Mixtures
Hyperspectral Image Dataset\\for Benchmarking on Salient Object Detection
Fake News Identification on Twitter with Hybrid CNN and RNN Models
On the Potential of Multi-Mode Antennas for Direction-of-Arrival Estimation
Bias in Semantic and Discourse Interpretation
On the integrability of strongly regular graphs
Unsupervised Detection and Explanation of Latent-class Contextual Anomalies
A flexible model for training action localization with varying levels of supervision
Posthoc Interpretability of Learning to Rank Models using Secondary Training Data
Knowledge-Based Distant Regularization in Learning Probabilistic Models
Inequivalent embeddings of 3-connected 3-regular planar graphs on the torus
Quantum aspects of high dimensional formal representation of conceptual spaces
Measuring the quality of Synthetic data for use in competitions
Herding behavior in cryptocurrency markets
Ignition: An End-to-End Supervised Model for Training Simulated Self-Driving Vehicles
Centre-of-mass like superposition of Ornstein-Uhlenbeck processes: a pathway to non-autonomous stochastic differential equations and to fractional diffusion
On the phase transition in resource-competition models
Detecting Mammals in UAV Images: Best Practices to address a substantially Imbalanced Dataset with Deep Learning
Bayesian Uncertainty Directed Trial Designs
Personalizing Similar Product Recommendations in Fashion E-commerce
Inhomogeneous Partition Regularity
Socio-economic constraints to maximum human lifespan
Learning from graphs with structural variation
Theory IIIb: Generalization in Deep Networks
Convergence Problems with Generative Adversarial Networks (GANs)
Marginally Parametrized Spatio-Temporal Models and Stepwise Maximum Likelihood Estimation
On embeddings as alternative paradigm for relational learning
Revealing subgroup structure in ranked data using a Bayesian WAND
Uniform tilings of the hyperbolic plane
WEBCA: Weakly-Electric-Fish Bioinspired Cognitive Architecture
Stability conditions for the explicit integration of projection based nonlinear reduced-order and hyper reduced structural mechanics finite element models
$\mathcal{U}$-bootstrap percolation: critical probability, exponential decay and applications
A Probabilistic Modelling Approach to One-Shot Gesture Recogntion
(k,p)-Planarity: A Generalization of Hybrid Planarity
Bounds on the Approximation Power of Feedforward Neural Networks
Discourse-Wizard: Discovering Deep Discourse Structure in your Conversation with RNNs
States-conserving density of states for Altshuler-Aronov effect: Heuristic derivation
Hierarchical Robust Analysis for Identified Systems in Network
Continuity result for the rate function of the simple random walk on supercritical percolation clusters
Certifying Global Optimality of Graph Cuts via Semidefinite Relaxation: A Performance Guarantee for Spectral Clustering
Towards real-time unsupervised monocular depth estimation on CPU
Using General Adversarial Networks for Marketing: A Case Study of Airbnb
Complying with Data Handling Requirements in Cloud Storage Systems
Strong Solutions of Mean-Field Stochastic Differential Equations with irregular drift
MRFusion: A Deep Learning architecture to fuse PAN and MS imagery for land cover mapping
Simulating Ising and Potts models and external fields with non-equilibrium condensates
Investigating Speech Features for Continuous Turn-Taking Prediction Using LSTMs
Bayesian Deep Learning on a Quantum Computer
Neighbor-Locating Colorings in Graphs
Subvector Inference in Partially Identified Models with Many Moment Inequalities
Simplified Active Calibration
SynNet: Structure-Preserving Fully Convolutional Networks for Medical Image Synthesis
Play Duration based User-Entity Affinity Modeling in Spoken Dialog System
Deep Learning and its Application to LHC Physics
Constructions of Locally Recoverable Codes which are Optimal
Comparing Graph Clusterings: Set partition measures vs. Graph-aware measures
Mammographic Image Enhancement using Digital Image Processing Technique
Bayesian Counterfactual Risk Minimization
Learning with minimal information in continuous games
Recognition of Offline Handwritten Devanagari Numerals using Regional Weighted Run Length Features
Sparse Three-parameter Restricted Indian Buffet Process for Understanding International Trade
A Hoeffding inequality for Markov chains
Rapid covariance-based sampling of linear SPDE approximations in the multilevel Monte Carlo method
Counting to Explore and Generalize in Text-based Games
Co-Diffusion of Social Contagions
Facility location under matroid constraints: fixed-parameter algorithms and applications
Understanding the Nonlinear Behavior and Frequency Stability of a Grid-synchronized VSC Under Grid Voltage Dips
Visual Attention and its Intimate Links to Spatial Cognition
The Sphere Packing Bound for DSPCs with Feedback a la Augustin
TextWorld: A Learning Environment for Text-based Games
End-to-end Learning of Multi-sensor 3D Tracking by Detection
Quantized Decentralized Consensus Optimization
Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation
High Dimensional Discrete Integration by Hashing and Optimization
Nonparametric learning from Bayesian models with randomized objective functions
The sharp phase transition for level set percolation of smooth planar Gaussian fields
Algorithmic Pirogov-Sinai theory
Conformal invariance of the loop-erased percolation explorer