KeyError when calling to_coo() on SparseDataFrame #18414

ediphy-azorab · 2017-11-21T17:20:31Z

Code Sample, a copy-pastable example if possible

In [45]: t_df = idx = pd.Int64Index([2,3,4])
    ...: t_df = pd.DataFrame(data=0, columns=idx, index=idx)
    ...: t_df.apply(pd.SparseArray).sparse.to_coo() # This line blows up in find_common_types

Problem description

Currently, a KeyError is raised while trying to get the first column type (cast.py#1070).

Expected Output

A (very) sparse matrix

Output of `pd.show_versions()`

INSTALLED VERSIONS ------------------ commit: None python: 3.6.3.final.0 python-bits: 64 OS: Linux OS-release: 4.10.0-38-generic machine: x86_64 processor: byteorder: little LC_ALL: None LANG: C.UTF-8 LOCALE: en_US.UTF-8

pandas: 0.21.0
pytest: None
pip: 10.0.0.subpip_fix
setuptools: 36.5.0
Cython: None
numpy: 1.13.3
scipy: 1.0.0
pyarrow: None
xarray: None
IPython: 6.2.1
sphinx: None
patsy: None
dateutil: 2.6.1
pytz: 2017.3
blosc: None
bottleneck: None
tables: None
numexpr: None
feather: None
matplotlib: 2.1.0
openpyxl: None
xlrd: None
xlwt: None
xlsxwriter: None
lxml: 4.1.0
bs4: 4.6.0
html5lib: 1.0b10
sqlalchemy: None
pymysql: None
psycopg2: None
jinja2: 2.9.6
s3fs: None
fastparquet: None
pandas_gbq: None
pandas_datareader: 0.5.0

The text was updated successfully, but these errors were encountered:

mukherjees · 2018-05-03T14:04:18Z

Same issue exists with Pandas 0.22.0 and Python 3.6.4.

h4ste · 2018-07-23T20:57:21Z

Exists with Pandas 0.23.3 and Python 3.6.6

INSTALLED VERSIONS

commit: None
python: 3.6.6.final.0
python-bits: 64
OS: Linux
OS-release: 4.4.0-128-generic
machine: x86_64
processor: x86_64
byteorder: little
LC_ALL: None
LANG: en_US.UTF-8
LOCALE: en_US.UTF-8

pandas: 0.23.3
pytest: None
pip: 10.0.1
setuptools: 39.1.0
Cython: None
numpy: 1.14.5
scipy: 1.1.0
pyarrow: None
xarray: None
IPython: 6.4.0
sphinx: None
patsy: 0.5.0
dateutil: 2.7.3
pytz: 2018.5
blosc: None
bottleneck: None
tables: None
numexpr: None
feather: None
matplotlib: 2.2.2
openpyxl: None
xlrd: None
xlwt: None
xlsxwriter: None
lxml: None
bs4: None
html5lib: 1.0.1
sqlalchemy: None
pymysql: None
psycopg2: None
jinja2: 2.10
s3fs: None
fastparquet: None
pandas_gbq: None
pandas_datareader: None

jorisvandenbossche · 2019-05-29T07:44:42Z

We should be using positional indexing here (so types.iloc[0] instead of types[0])

TomAugspurger · 2019-09-17T21:04:27Z

Still applies to DataFrame[sparse]. Updated the original post.

khalludi · 2020-02-07T03:35:04Z

take

jbrockmendel added the Sparse Sparse Data Type label Jul 30, 2018

jorisvandenbossche added Bug good first issue labels May 29, 2019

jorisvandenbossche added this to the 0.25.0 milestone May 29, 2019

jorisvandenbossche modified the milestones: 0.25.0, Contributions Welcome May 29, 2019

TomAugspurger mentioned this issue Sep 16, 2019

Remove SparseSeries and SparseDataFrame #28425

Merged

github-actions bot assigned khalludi Feb 7, 2020

khalludi added a commit to khalludi/pandas that referenced this issue Feb 7, 2020

BUG: Fixes index bug in cast.py for issue pandas-dev#18414

4a3e915

SpectrumWings mentioned this issue Mar 7, 2020

Find open issues CSCD01/team_24-project#1

Closed

3 tasks

mzeitlin11 mentioned this issue Dec 18, 2020

BUG: .sparse.to_coo() with numeric col index without a 0 #38567

Merged

5 tasks

jreback modified the milestones: Contributions Welcome, 1.3 Dec 22, 2020

jreback closed this as completed in #38567 Dec 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KeyError when calling to_coo() on SparseDataFrame #18414

KeyError when calling to_coo() on SparseDataFrame #18414

ediphy-azorab commented Nov 21, 2017 •

edited by TomAugspurger

Loading

mukherjees commented May 3, 2018

h4ste commented Jul 23, 2018 •

edited

Loading

jorisvandenbossche commented May 29, 2019

TomAugspurger commented Sep 17, 2019

khalludi commented Feb 7, 2020

KeyError when calling to_coo() on SparseDataFrame #18414

KeyError when calling to_coo() on SparseDataFrame #18414

Comments

ediphy-azorab commented Nov 21, 2017 • edited by TomAugspurger Loading

Code Sample, a copy-pastable example if possible

Problem description

Expected Output

Output of pd.show_versions()

mukherjees commented May 3, 2018

h4ste commented Jul 23, 2018 • edited Loading

INSTALLED VERSIONS

jorisvandenbossche commented May 29, 2019

TomAugspurger commented Sep 17, 2019

khalludi commented Feb 7, 2020

ediphy-azorab commented Nov 21, 2017 •

edited by TomAugspurger

Loading

Output of `pd.show_versions()`

h4ste commented Jul 23, 2018 •

edited

Loading