Skip to content

GitLab

  • Menu
Projects Groups Snippets
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
  • eBookHub-backend eBookHub-backend
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 20
    • Issues 20
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 5
    • Merge requests 5
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Container Registry
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • 0x43
  • eBookHub-backendeBookHub-backend
  • Issues
  • #23

Closed
Open
Created May 02, 2019 by Patrick van der Leer@pat.vdleerOwner

non ASCII character in XML from external source

value in XML from goodreads: Ausz\xc3\xbcge

xml.parsers.expat.ExpatError: not well-formed (invalid token): line 41, column 1158
Job "BlackHole processor (trigger: interval[0:01:00], next run at: 2019-05-02 15:59:10 CEST)" raised an exception
Traceback (most recent call last):
  File "/home/patrick/PycharmProjects/ebookhub-backend/venv/lib/python3.6/site-packages/apscheduler/executors/base.py", line 125, in run_job
    retval = job.func(*job.args, **job.kwargs)
  File "/home/patrick/PycharmProjects/ebookhub-backend/eBookHub/jobs/blackhole.py", line 157, in process_blackhole_job
    list(process_blackhole(os.path.abspath(blackhole_path)))
  File "/home/patrick/PycharmProjects/ebookhub-backend/eBookHub/jobs/blackhole.py", line 143, in process_blackhole
    saved, new, book_edition_file = handle(blackhole_path, file)
  File "/home/patrick/PycharmProjects/ebookhub-backend/eBookHub/jobs/blackhole.py", line 130, in handle
    resultset = process_file(filename)
  File "/home/patrick/PycharmProjects/ebookhub-backend/eBookHub/jobs/blackhole.py", line 27, in process_file
    result = source.parse(book_data_raw['title'], book_data_raw['authors'])
  File "/home/patrick/PycharmProjects/ebookhub-backend/eBookHub/source/abstract.py", line 58, in parse
    return self.search_title_and_author(title, authors)
  File "/home/patrick/PycharmProjects/ebookhub-backend/venv/lib/python3.6/site-packages/flask_caching/__init__.py", line 769, in decorated_function
    rv = f(*args, **kwargs)
  File "/home/patrick/PycharmProjects/ebookhub-backend/eBookHub/source/goodreads.py", line 40, in search_title_and_author
    return self.parse_search_result(self.client.search_book("{} - {}".format(title, author)))
  File "/home/patrick/PycharmProjects/ebookhub-backend/eBookHub/source/goodreads.py", line 51, in parse_search_result
    result.append(self.book_id(entry['id']["#text"]))
  File "/home/patrick/PycharmProjects/ebookhub-backend/venv/lib/python3.6/site-packages/flask_caching/__init__.py", line 769, in decorated_function
    rv = f(*args, **kwargs)
  File "/home/patrick/PycharmProjects/ebookhub-backend/eBookHub/source/goodreads.py", line 24, in book_id
    return self.transformer.convert_book(self.client.Book.show(eid))
  File "/home/patrick/PycharmProjects/ebookhub-backend/venv/lib/python3.6/site-packages/goodreads_api_client/resources/book.py", line 31, in show
    return self._show_single_resource(id_)
  File "/home/patrick/PycharmProjects/ebookhub-backend/venv/lib/python3.6/site-packages/goodreads_api_client/resources/base.py", line 18, in _show_single_resource
    res = self._transport.req(endpoint=endpoint, uses_oauth=uses_oauth)
  File "/home/patrick/PycharmProjects/ebookhub-backend/venv/lib/python3.6/site-packages/goodreads_api_client/transport.py", line 152, in req
    return Transport._transform_res(res, transform)
  File "/home/patrick/PycharmProjects/ebookhub-backend/venv/lib/python3.6/site-packages/goodreads_api_client/transport.py", line 140, in _transform_res
    content = xmltodict.parse(res.text)
  File "/home/patrick/PycharmProjects/ebookhub-backend/venv/lib/python3.6/site-packages/xmltodict.py", line 330, in parse
    parser.Parse(xml_input, True)
xml.parsers.expat.ExpatError: not well-formed (invalid token): line 6, column 382
Edited May 02, 2019 by Patrick van der Leer
Assignee
Assign to
Time tracking