The option was added in https://github.com/pandas-dev/pandas/pull/50750 to be ready for 2.0. We should do the dtype inference before actually creating the DataFrame/Series, this should improve performance
Comment From: kostyafarber
Would be interesting in working on this. What would be a good start?
Comment From: phofl
The main idea is the following: We create numpy-arrays before converting them to a DataFrame. Right now we are converting to extension arrays after creating the DataFrame. Ideally, we want to convert to Extension Arrays before even creating the DataFrame
Comment From: kostyafarber
Will this need tests? or are the tests you did in https://github.com/pandas-dev/pandas/pull/50750 enough?
Comment From: phofl
Probably enough except if you discover things that are not covered