Maximum Likelihood Estimation for Generalized Linear Mixed Models (mcemGLM)
Maximum likelihood estimation for generalized linear mixed models via Monte Carlo EM.
File-Backed Matrix Class with Convenient Read and Write Access (filematrix)
Interface for working with large matrices stored in files, not in computer memory. Supports multiple data types (double, integer, logical and raw) of different sizes (e.g. 4, 2, or 1 byte integers). Access to parts of the matrix is done by indexing, exactly as with usual R matrices. Supports very large matrices (tested on 1 terabyte matrix), allowing for more than 2^32 rows or columns. Cross-platform as the package has R code only, no C/C++.
A Resource Manager for R Objects (repo)
This is an R data manager meant to avoid manual storage/retrieval of R data to/from the file system. It builds one (or more) centralized repository where R objects are stored with annotations, tags, dependency notes, provenance traces. It also provides navigation tools to easily locate, load and edit previously stored resources.
Density Based Clustering of Applications with Noise (DBSCAN) (dbscan)
A fast reimplementation of the DBSCAN clustering algorithm using the kd-tree data structure for speedup.
Software Tools to Quantify Structural Importance of Nodes in a Network (influenceR)
Provides functionality to compute various node centrality measures on networks. Included are functions to compute betweenness centrality (by utilizing Madduri and Bader’s SNAP library), implementations of Burt’s constraint and effective network size (ENS) metrics, Borgatti’s algorithm to identify key players, and Valente’s bridging metric. On Unix systems, the betweenness, Key Players, and bridging implementations are parallelized with OpenMP, which may run faster on systems which have OpenMP configured.