I have contract clauses in Words (.docx) format that needs to be frequently copy and pasted into Excel (.xlsx) to be sent to the third party. The clauses are often updated hence there's always a need to copy and paste these clauses over. I only need to copy and paste all the paragraphs and images after the contents page. Here is a sample of the Clause document (https://drive.google.com/open?id=1ZzV29R6y2q0oU3HAVrqsFa158OhvpxEK).
I have tried doing up a code using Python to achieve this outcome. Here is the code that I have done so far: !pip install python-docx import docx import xlsxwriter document = docx.Document("Clauses Sample.docx") wb = xlsxwriter.Workbook('C:/xxxx/xxxxxx/xxxx/clauses sample.xlsx') docText = [] index_row = 0 Sheet1 = wb.add_worksheet("Sheetttt") for paragraph in document.paragraphs: if paragraph.text: docText.append(paragraph.text) xx = '\n'.join(docText) Sheet1.write(index_row,0, xx) index_row = index_row+1 wb.close() #print(xx) However, my Excel file output looks like this: I can't seem to paste pictures into this discussion so please see both my current and desired Excel output here: https://stackoverflow.com/questions/60800494/how-to-copy-paragraphs-with-number-formatting-and-images-from-words-docx-an -- https://mail.python.org/mailman/listinfo/python-list