Converting a complex PDF schedule table to JSON in Python, handling empty slots and multi-line names [closed]

I am trying to convert a PDF containing a university department's class schedule into JSON format using Python. The table in the PDF is similar to the one shown in the attached image. I have tried using simple text extraction and parsing techniques, but I am encountering two main problems:

Empty slots: The script is not correctly identifying and representing empty slots in the schedule.Multi-line teacher names: If a teacher's name spans multiple lines in the PDF, the script is not capturing the full name correctly.I am wondering if it is possible to achieve this conversion using basic Python libraries or any other language and techniques, or if I need to resort to training an AI model for more sophisticated text understanding.

Single Timetable in pdf looks like this:

single Timetable in pdf looks like this

I have already tried using libraries like PyPDF2 and pdfplumber for text extraction. I am open to using other libraries or tools if they can help solve the problems I am facing. The ultimate goal is to be able to process the JSON data and use it for further analysis or visualization. How to approach this task effectively?

Converting a complex PDF schedule table to JSON in Python, handling empty slots and multi-line names [closed]

Trending Articles

Teen Shot In Miami Drive-By Dies From Injuries

Police confirm man stabbed to death in Selsdon was Andrew David Else of Croydon

Charlotte de Witte – One Mind – EP [iTunes Plus M4A]

Obituaries for Friday, June 27, 2025

A/L Technology Stream – Subject combinations, Syllabuses and Teacher guides

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Re: My Sisters Plan For Me To Smell Her Feet (Fiction): Part 1,2,3 and 4!!!

Srinagar Kitty’s brother dies at 67 due to Covid-19

Re: How to fix error on printer HP Color LaserJet Pro MFP 3303 with event...

TAPERED BEAM DESIGN OUTPUT

Black Angus Grilled Artichokes

Download: Mirraj Malifah – Chance Yako (Prod by_Bicko @Musiqhouse)

D16 Group Phoscyon v1.9.5 Incl.Keygen WiN/MAC-R2R

Mace 6 FEN Loot 2.0 +60k pureped BO

Raj Panchayat 3rd / Third Grade Teacher Revised Result 2012 Level 1-2...

NFL Player’s Father Went Down In Feds’‘Operation FedEx’ Bust In Detroit Back...

Current scandal has roots in NPF saga

spreading clines

HResult: 0x80240033 Context: uecGeneral Msg: The license terms of one or more...

Procedure for conduct of supplementary DPC