Pandas version checks

  • [X] I have checked that this issue has not already been reported.

  • [X] I have confirmed this bug exists on the latest version of pandas.

  • [ ] I have confirmed this bug exists on the main branch of pandas.

Reproducible Example

import pandas as pd
from io import StringIO

csv = 'col1,col2\n""malformed","quotes"",'
io = StringIO(csv)

pd.read_csv(io, on_bad_lines="skip")  # or alternatively "warn"

# ParserError: Error tokenizing data. C error: EOF inside string starting at row 1

Issue Description

The combination of

  • malformed text with quotes and a delimiter hidden in between
  • doublequote=True
  • on_bad_lines="skip"

raises a ParserError instead of skipping the malformed row.

Expected Behavior

The reader skips the malformed row and reads the csv successfully.

Installed Versions

commit : https://github.com/pandas-dev/pandas/commit/4bfe3d07b4858144c219b9346329027024102ab6 python : 3.8.10.final.0 python-bits : 64 OS : Linux OS-release : 5.4.0-89-generic Version : https://github.com/pandas-dev/pandas/issues/100-Ubuntu SMP Fri Sep 24 14:50:10 UTC 2021 machine : x86_64 processor : x86_64 byteorder : little LC_ALL : None LANG : pl_PL.UTF-8 LOCALE : pl_PL.UTF-8 pandas : 1.4.2 numpy : 1.22.3 pytz : 2022.1 dateutil : 2.8.2 pip : 20.0.2 setuptools : 45.2.0 Cython : None pytest : None hypothesis : None sphinx : None blosc : None feather : None xlsxwriter : None lxml.etree : None html5lib : None pymysql : None psycopg2 : None jinja2 : None IPython : 8.3.0 pandas_datareader: None bs4 : 4.8.2 bottleneck : None brotli : None fastparquet : None fsspec : None gcsfs : None markupsafe : 1.1.0 matplotlib : None numba : None numexpr : None odfpy : None openpyxl : None pandas_gbq : None pyarrow : None pyreadstat : None pyxlsb : None s3fs : None scipy : None snappy : None sqlalchemy : None tables : None tabulate : None xarray : None xlrd : None xlwt : None zstandard : None