Very Efficient Training of Convolutional Neural Networks using Fast Fourier Transform and Overlap-and-Add

Convolutional neural networks (CNNs) are currently state-of-the-art for various classification tasks, but are computationally expensive. Propagating through the convolutional layers is very slow, as each kernel in each layer must sequentially calculate many dot products for a single forward and backward propagation which equates to \mathcal{O}(N^{2}n^{2}) per kernel per layer where the inputs are N \times N arrays and the kernels are n \times n arrays. Convolution can be efficiently performed as a Hadamard product in the frequency domain. The bottleneck is the transformation which has a cost of \mathcal{O}(N^{2}\log_2 N) using the fast Fourier transform (FFT). However, the increase in efficiency is less significant when N\gg n as is the case in CNNs. We mitigate this by using the ‘overlap-and-add’ technique reducing the computational complexity to \mathcal{O}(N^2\log_2 n) per kernel. This method increases the algorithm’s efficiency in both the forward and backward propagation, reducing the training and testing time for CNNs. Our empirical results show our method reduces computational time by a factor of up to 16.3 times the traditional convolution implementation for a 8 \times 8 kernel and a 224 \times 224 image.


A Survey on Artificial Intelligence and Data Mining for MOOCs

Massive Open Online Courses (MOOCs) have gained tremendous popularity in the last few years. Thanks to MOOCs, millions of learners from all over the world have taken thousands of high-quality courses for free. Putting together an excellent MOOC ecosystem is a multidisciplinary endeavour that requires contributions from many different fields. Artificial intelligence (AI) and data mining (DM) are two such fields that have played a significant role in making MOOCs what they are today. By exploiting the vast amount of data generated by learners engaging in MOOCs, DM improves our understanding of the MOOC ecosystem and enables MOOC practitioners to deliver better courses. Similarly, AI, supported by DM, can greatly improve student experience and learning outcomes. In this survey paper, we first review the state-of-the-art artificial intelligence and data mining research applied to MOOCs, emphasising the use of AI and DM tools and techniques to improve student engagement, learning outcomes, and our understanding of the MOOC ecosystem. We then offer an overview of key trends and important research to carry out in the fields of AI and DM so that MOOCs can reach their full potential.


BUbiNG: Massive Crawling for the Masses

Although web crawlers have been around for twenty years by now, there is virtually no freely available, opensource crawling software that guarantees high throughput, overcomes the limits of single-machine systems and at the same time scales linearly with the amount of resources available. This paper aims at filling this gap, through the description of BUbiNG, our next-generation web crawler built upon the authors’ experience with UbiCrawler [Boldi et al. 2004] and on the last ten years of research on the topic. BUbiNG is an opensource Java fully distributed crawler; a single BUbiNG agent, using sizeable hardware, can crawl several thousands pages per second respecting strict politeness constraints, both host- and IP-based. Unlike existing open-source distributed crawlers that rely on batch techniques (like MapReduce), BUbiNG job distribution is based on modern high-speed protocols so to achieve very high throughput.


A Novel Memetic Feature Selection Algorithm

Feature selection is a problem of finding efficient features among all features in which the final feature set can improve accuracy and reduce complexity. In feature selection algorithms search strategies are key aspects. Since feature selection is an NP-Hard problem; therefore heuristic algorithms have been studied to solve this problem. In this paper, we have proposed a method based on memetic algorithm to find an efficient feature subset for a classification problem. It incorporates a filter method in the genetic algorithm to improve classification performance and accelerates the search in identifying core feature subsets. Particularly, the method adds or deletes a feature from a candidate feature subset based on the multivariate feature information. Empirical study on commonly data sets of the university of California, Irvine shows that the proposed method outperforms existing methods.


Sentiment Analysis of Twitter Data :A Survey of Techniques

With the advancement of web technology and its growth, there is a huge volume of data present in the web for internet users and a lot of data is generated too. Internet has become a platform for online learning, exchanging ideas and sharing opinions. Social networking sites like Twitter, Facebook, Google+ are rapidly gaining popularity as they allow people to share and express their views about topics,have discussion with different communities, or post messages across the world. There has been lot of work in the field of sentiment analysis of twitter data. This survey focuses mainly on sentiment analysis of twitter data which is helpful to analyze the information in the tweets where opinions are highly unstructured, heterogeneous and are either positive or negative, or neutral in some cases. In this paper, we provide a survey and a comparative analyses of existing techniques for opinion mining like machine learning and lexicon-based approaches, together with evaluation metrics. Using various machine learning algorithms like Naive Bayes, Max Entropy, and Support Vector Machine, we provide a research on twitter data streams. We have also discussed general challenges and applications of Sentiment Analysis on Twitter


A Distributed and Incremental SVD Algorithm for Agglomerative Data Analysis on Large Networks

In this paper, we show that the SVD of a matrix can be constructed efficiently in a hierarchical approach. Our algorithm is proven to recover the singular values and left singular vectors if the rank of the input matrix A is known. Further, the hierarchical algorithm can be used to recover the d largest singular values and left singular vectors with bounded error. We also show that the proposed method is stable with respect to roundoff errors or corruption of the original matrix entries. Numerical experiments validate the proposed algorithms and parallel cost analysis.


Intelligent Conversational Bot for Massive Online Open Courses (MOOCs)

Massive Online Open Courses (MOOCs) which were introduced in 2008 has since drawn attention around the world for both its advantages as well as criticism on its drawbacks. One of the issues in MOOCs which is the lack of interactivity with the instructor has brought conversational bot into the picture to fill in this gap. In this study, a prototype of MOOCs conversational bot, MOOC-bot is being developed and integrated into MOOCs website to respond to the learner inquiries using text or speech input. MOOC-bot is using the popular Artificial Intelligence Markup Language (AIML) to develop its knowledge base, leverage from AIML capability to deliver appropriate responses and can be quickly adapted to new knowledge domains. The system architecture of MOOC-bot consists of knowledge base along with AIML interpreter, chat interface, MOOCs website and Web Speech API to provide speech recognition and speech synthesis capability. The initial MOOC-bot prototype has the general knowledge from the past Loebner Prize winner – ALICE, frequent asked questions, and a content offered by Universiti Teknikal Malaysia Melaka (UTeM). The evaluation of MOOC-bot based on the past competition questions from Chatterbox Challenge (CBC) and Loebner Prize has shown that it was able to provide correct answers most of the time during the test and demonstrated the capability to prolong the conversation. The advantages of MOOC-bot such as able to provide 24-hour service that can serve different time zones, able to have knowledge in multiple domains, and can be shared by multiple sites simultaneously have outweighed its existing limitations.


Stochastic Airy semigroup through tridiagonal matrices

P-values: misunderstood and misused

Variational formulas for the power of the binary hypothesis testing problem with applications

New Pivot Selection for Sparse Symmetric Indefinite Factorization

Dynamics of Disordered States in the Bose-Hubbard Model with Confinement

Survey on the attention based RNN model and its applications in computer vision

Combinatorial algorithm for counting small induced graphs and orbits

Multivariate convex regression: global risk bounds and adaptation

Chip firing on Dynkin diagrams and McKay quivers

On distributionally robust extreme value analysis

Quickest Change Detection with Mismatched Post-Change Models

Spectral analogues of Moon-Moser’s theorem on Hamilton paths in bipartite graphs

Functional archetype and archetypoid analysis

Edge-Disjoint Node-Independent Spanning Trees in Dense Gaussian Networks

Vertex Colorings without Rainbow or Monochromatic Subgraphs

Identification and classification of TCM syndrome types among patients with vascular mild cognitive impairment using latent tree analysis

Fisher Motion Descriptor for Multiview Gait Recognition

Simple and Efficient Fully-Functional Succinct Trees

Structure trees, networks and almost invariant sets

Robust Optimal Risk Sharing and Risk Premia in Expanding Pools

Synchronized, Chimera and Multichimera States in Simple Cellular Automata Model of Coupled Oscillators

On the Cycle Structure of Mallows Permutations

Moment explosions, implied volatility and local volatility at extreme strikes

Integral equation for the transition density of the multidimensional Markov random flight

Suppression of neuronal phase synchronisation in cat cerebral cortex

Spectral characterizations of two families of nearly complete bipartite graphs

A Cost-Effective Strategy for Storing Scientific Datasets with Multiple Service Providers in the Cloud

Darning and gluing of diffusions

Bidding policies for market-based HPC workflow scheduling

Trapezoidal numbers, divisor functions, and a partition theorem of Sylvester

Permutation codes, source coding and a generalisation of Bollobás-Lubell-Yamamoto-Meshalkin and Kraft inequalities

Holistic Approach for Fault-Tolerant Network-on-Chip based Many-Core Systems

Absolute continuity of the invariant measure in Piecewise Deterministic Markov Processes having degenerate jumps

LIA-RAG: a system based on graphs and divergence of probabilities applied to Speech-To-Text Summarization

Instrumental variable approaches for estimating complier average causal effects on bivariate outcomes in randomised trials with non-compliance

Energy Efficiency of Many-Soft-Core Processors

Sparse exchangeable graphs and their limits via graphon processes

On the robustness to small trends of parameter estimation for continuous-time stationary models with memory

Inducibility in binary trees and crossings in random tanglegrams

Core partitions into distinct parts and an analog of Euler’s theorem

Total flooding time and rumor propagation on graphs