xref https://github.com/pandas-dev/pandas/pull/14859
- [ ]
.value_counts()
- [ ]
.duplicated()
Comment From: jbrockmendel
@mroeschke do we already do this for string[python]? i assume string[pyarrow] has a separate more-performant implementation?
Comment From: mroeschke
From a quick skim of pandas/_libs/hashtable_func_helper.pxi.in
looks like we don't use the StringHashTable
yet, still the object one.
Yeah the string[pyarrow] one we want to avoid going through here I imagine