kpodclustr


An R package for implementing $k$-POD, a method for clustering partially observed data


(Note: Python implementation by Ishaan Radia and Eric C. Chi can be found at https://pypi.org/project/kPOD/.)

About the package

The ${\tt kpodclustr}$ package contains R code for implementing $k$-POD, a method for $k$-means clustering on partially observed data.

Requirements

You’ll need R to use this package. If you don’t already have R, you can download it at the following links for Mac OS X or Windows. We also highly recommend the RStudio IDE.

Installation

You can install the package by typing the following in the R console.

install.packages("kpodclustr")

Documentation

After installation and loading the package in R with library(kpodclustr), you can browse the documentation for function usage and examples via the help pages with help(kpod).

The package manual and source files can also be found on CRAN.

Citation

To cite ${\tt kpodclustr}$ in publications, please use citation('kpodclustr') or the following.

Jocelyn T. Chi, Eric C. Chi, and Richard G. Baraniuk, “$k$-POD: A Method for $k$-Means Clustering of Missing Data,” The American Statistician, vol. 70, iss. 1, pp. 91-99, 2016.

A BibTeX entry for LaTeX users is

@ARTICLE{ChiChiBaraniuk2016,
  author = {Jocelyn T. Chi, Eric C. Chi, and Richard G. Baraniuk},
  title = {$k$-POD: A Method for $k$-Means Clustering of Missing Data},
  journal = {The American Statistician},
  year = {2016},
  volume = {70},
  pages = {91-99},
  doi = {10.1080/00031305.2015.1086685},
  issue = {1},
  url = {http://www.tandfonline.com/doi/abs/10.1080/00031305.2015.1086685?journalCode=utas20&}
}