Author/Maintainer : Ananda Mahto Email : ananda@mahto.info URL : http://github.com/mrdwab/splitstackshape BugReports : http://github.com/mrdwab/splitstackshape/issues
Updated to pass CRAN tests due to changes in the RNG sample kinds.
22 July 2018
Interim release to help the data.table team with reverse
dependency checks.
05 April 2018
Preparing for transition to V2 of the splitstackshape package.
.Deprecated(). These include
concat.split.multiple and
concat.split.compact, both of which can now just directly
use cSplit.cSplit_f has been removed as it would no longer be
relevant in V2 of the package and isn’t entirely reliable the way it has
been written. fread, on which the function was based, has
underwent many changes since the function was written.Tests have been added covering most basic cases, but not for all potential bugs that have been fixed in V2 of the package.
stratified has been fixed.cSplit_f has been removed.29 March 2018
Reshape() bugfix. Reported at https://stackoverflow.com/q/49281838/1270695.listCol_w() bugfix. Thanks to @jazzurro.cSplit_e() bugfix. Reported at https://stackoverflow.com/q/48576331/127069520 March 2018
23 October 2014
listCol_l and listCol_w added as utilities
for unlisting or flattening columns stored as lists in
data.frames and data.tables.18 October 2014
Bug in :::.stripWhite when using "|" as a
delimiter fixed.
13 October 2014
See 1.3.0 – 1.3.8 for details of changes.
cSplit now replaces concat.split.compact
and concat.split.multiple in concat.split;
cSplit_f has been introduced as a related function. Other
new functions are stratified and
expandRows.
12 October 2014
cSplit_f
The “_f” is both representative of fread, which this
function uses to split the concatenated cells, and “fixed”, which is
indicative of the fact that this function would only work if the number
of resulting columns is the same for each row in the input.
expandRows
“Expand” the rows of a data.frame or a
data.table either by values specified in a column of the
input dataset or by a vector specifying the number of times to repeat
each row.
Reshape, Stacked, and
merged.stack now try to guess the “id.vars”
values based on the values in “var.stubs”. The values can
still be specified manually.08/10 October 2014
Incremental cleanups and additions to get ready for V1.4.0.
concat.split.compact and
concat.split.multiple are now simply wrappers for
cSplit and no longer use :::read.concat to
split up the values.concat.split.expanded and
concat.split.list now made data.table
compatable.concat.split.list and
concat.split.expanded given short name forms
(cSplit_l and cSplit_e).Added functions:
cSplit
Before the release of 1.4.0, the basic concat.split*
functions would become simple wrappers for cSplit, which is
much more efficient than the previous implementations. The earlier
functions will remain for compatability purposes. Since
cSplit is already in use, it will be an exported
function.
stratified
A function to take fixed or proportional samples by group from a
data.frame or data.table.
Non-exported additions:
:::.collapseMe:::.stripWhite:::Names:::trim:::vGrep27 October 2013
numMat and charMat,
concat.split.expanded and concat.split now
have an additional argument, type, which takes a value of
either "numeric" or "character". It is set to
a default of type = "numeric" in the case of
concat.split.expanded and type = NULL in the
case of concat.split.Added functions:
:::numMat
numMat replaces binaryMat and
valueMat for numeric data.
:::charMat
charMat replaces charBinaryMat for string
data.
Dropped functions:
Due to changes introduced after recommendations by @flodel, the following
functions have been rewritten as numMat and
charMat
20 October 2013
New function added:
:::charBinaryMat
concat.split.expanded did not previously support
expanding “character” data. Due to prompting by @juba, charBinaryMat has been
included to handle such cases.
27 August 2013
Stacked and
merge.stack. merge.stack is now faster than
Reshape, at least for large datasets.18 August 2013
Stacked and merge.stack now made MUCH
faster using almost a pure data.table solution.17 August 2013
Stacked results in a list of length 1, it is
“unlisted” before being returned.Reshape (and as a result,
concat.split.multiple(..., direction = "long")) has been
enhanced by the addition of a feature to automatically add an ID
variable if the present “IDs” are not unique.New functions added:
16 August 2013
read.concat updated to use count.fields to
determine the correct number of columns that the resulting
data.frame should have.Reshape now has an option to remove the
rownames from the output, set to TRUE by
default.12 August 2013
Initial commit of splitstacshape with the following main functions:
concat.split (plus: concat.split.compact,
concat.split.expanded, concat.split.list, and
concat.split.multiple) – To split concatenated data into
more manageable data formats.Reshape – To help base R’s reshape function handle
unbalanced data and simplify the reshape syntax (wide to long
only).Stacked – To selectively stack columns of a
data.frame.Non-exported functions are indicated with ::: before
their names.
concat.split.compactconcat.split.expandedconcat.split.listconcat.split.multipleconcat.splitmerged.stackReshapeStacked:::binaryMat:::FacsToChars:::NoSep:::othernames:::read.concat:::valueMat