- [ ] check that plot does the right thing, i.e. ordering of the categories
- [ ] Make a utils.testing.makeCategorical()
Series(np.array(list('abcdefghij')).take(np.random.randint(0,10,size=len(df)))).astype('category')
+ NaN + type + size + unique kword? - [ ] check for
self.values.xxx
inSeries
and check that the methods using such call work when values is aCategorical
. See https://github.com/pydata/pandas/pull/7447 - [ ] Apply and Categoricals: https://github.com/pydata/pandas/issues/8065
- [ ] Categorical.searchsorted() #8420
- [x] docs: "[...] number of possible values (categories; -categories-+levels+ in R). " https://github.com/pydata/pandas/pull/8561
- [x] move assert_categorical_equal to testing.py and fix "categories" failure msg
- [x] check whatsnew: In the note change "levels" to "categories" and
s/Schultz/Schulz/
- [x] docstring in #8077 (comment)
- [x] add a link to "comparison with R" section from http://pandas-docs.github.io/pandas-docs-travis/comparison_with_r.html
- [x] check that the memory info is doing the right thing for categorical (https://github.com/pydata/pandas/pull/7619)
Comment From: jreback
https://github.com/ContinuumIO/dynd-python/blob/master/docs/source/type.rst
@JanSchulz FYI ndt is libdynd essentially a modern version of numpy, supporting among things: categorical, missing values in int, var lenstrings
going to integregrate some of this for 0.16 (it coexists with numpy)
Comment From: jankatins
IMO this Bug can be set to 0.15.1 or later, the more urgent ones are done in the individual PRs.
Comment From: jreback
why don't u close this one and roll a new issue for the remaining items
Comment From: jreback
@JanSchulz what's left on this one?
Comment From: jankatins
IMO the rest of the ToDos from above can wait and categorical is ready for a release if nothing new pops up. I'm away from my computer for the next week, so hopefully that wont happen...
Comment From: jreback
@JanSchulz optimistic :) hahah
Comment From: jankatins
"move assert_categorical_equal to testing.py" is done in https://github.com/pydata/pandas/commit/56dbb8cc0cf810bdf9af472f16f6f14c5393e8ee
Nice :-) Thanks to @immerrr
Comment From: shoyer
Just read the v0.15.0 release notes and noticed that they still use "levels" in a couple of spots instead of "categories" :(.
see that third warning message: http://pandas.pydata.org/pandas-docs/stable/whatsnew.html#v0-15-0-october-18-2014
Comment From: jreback
@jorisvandenbossche can you address @shoyer comment when you repush the docs? thxs
@shoyer joris is rebuilding the docs (and repushing) anyhow...thxs
Comment From: shoyer
also see the second bullet here: http://pandas.pydata.org/pandas-docs/stable/whatsnew.html#categoricals-in-series-dataframe
Comment From: jorisvandenbossche
@shoyer thanks for noting! I added fixes for them in #8586, and also added a bullet point for the rename levels -> categories (which was actually missing as far as I could see)
Comment From: TomAugspurger
Most of these have been addressed / aren't a priority now.