BUG : Fix Excel header NaN duplication with merged MultiIndex columns in to_excel #62576

justine202429 · 2025-10-04T11:28:17Z

closes BUG: NaN categorical in multi-level column gets replaced in to_excel output #62340
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

Alvaro-Kothe

It needs tests and an entry in whatsnew.

Alvaro-Kothe · 2025-10-19T15:54:29Z

pandas/tests/io/excel/test_writers.py

+        with ExcelWriter("test.xlsx", engine="openpyxl") as writer:
+            df.to_excel(writer, sheet_name="Sheet1", merge_cells=merge_cells)
+
+        reader = ExcelFile("test.xlsx")


Don't read and write to CWD, either use a temporary file, or use an in-memory buffer.

Alvaro-Kothe · 2025-10-19T15:56:13Z

pandas/tests/io/excel/test_writers.py

        DummyClass.assert_called_and_reset()

+    @td.skip_if_no("openpyxl")
+    def test_to_excel_multiindex_nan_in_columns(self, merge_cells):


Test all excel engines.

Alvaro-Kothe · 2025-10-19T15:58:07Z

pandas/tests/io/excel/test_writers.py

+        result = pd.read_excel(reader, index_col=0, header=[0, 1])
+
+        original_values = df.to_numpy()
+        result_values = result.to_numpy()
+        tm.assert_numpy_array_equal(original_values, result_values)


Create an expected DataFrame and use tm.assert_frame_equal

mathbruu · 2025-11-18T08:16:27Z

pre-commit.ci autofix

mathbruu · 2025-11-24T09:54:58Z

done

Alvaro-Kothe · 2025-11-25T01:09:11Z

pandas/tests/io/excel/test_writers.py

+        with ExcelFile(tmp_excel) as reader:
+            result = pd.read_excel(reader, index_col=0, header=[0, 1])
+
+        tm.assert_numpy_array_equal(result.to_numpy(), df.to_numpy())


This doesn't test the header.

The test validates that data survives the Excel round-trip. NaN in headers are written correctly (verified with openpyxl) but cannot be read back due to Excel treating empty cells as blanks. This is an Excel limitation, not a code bug.

Alvaro-Kothe · 2025-11-25T01:14:37Z

pandas/tests/io/excel/test_writers.py

            buf = BytesIO()
            df.to_excel(buf)

+    def test_to_excel_multiindex_nan_in_columns(self, merge_cells, tmp_excel):


This test passes on main

I don't understand is it not supposed to passed ?

The tests are expected to fail on main without the patch. If they’re passing, it means the bug isn’t actually being reproduced, so you are not truly verifying the fix.

I’m a bit confused: this test case doesn’t exist on main at all.
I only created it in this branch, so I don’t understand how it could be “passing on main.”
Is there something I’m missing?

If you remove your patch with

git restore --source=upstream/main -- pandas/io/formats/excel.py

and run the tests with

pytest pandas/tests/io/excel/test_writers.py

The test that you created still passes. Hence, it's not testing your fix.

GH#62340: Use original column values (with NaN) instead of NBSP-filled values when writing MultiIndex headers to Excel. - Modify _format_header_mi() to use columns.get_level_values() to get the original column values with NaN preserved - Add test to verify MultiIndex structure and data integrity are preserved during Excel round-trip - Note: read_excel() limitation means NaN in headers become empty cells in Excel and cannot be reconstructed on read, but data values are correctly preserved

…stency

justine202429 force-pushed the bugfix-Issue62340 branch from 6703b44 to a5784f5 Compare October 4, 2025 11:38

mathbruu force-pushed the bugfix-Issue62340 branch from a5784f5 to b34c5d6 Compare October 9, 2025 09:04

justine202429 force-pushed the bugfix-Issue62340 branch from af60451 to a85715d Compare October 13, 2025 08:32

Fix Excel header NaN

5f70c16

justine202429 force-pushed the bugfix-Issue62340 branch from a85715d to 5f70c16 Compare October 13, 2025 08:50

justine202429 marked this pull request as ready for review October 13, 2025 09:26

Alvaro-Kothe reviewed Oct 14, 2025

View reviewed changes

Alvaro-Kothe added the IO Excel read_excel, to_excel label Oct 14, 2025

mathbruu force-pushed the bugfix-Issue62340 branch from fbdd7dc to 1eaf805 Compare October 15, 2025 11:40

add test and update whatsnews

1b5fe97

mathbruu force-pushed the bugfix-Issue62340 branch from 1eaf805 to 1b5fe97 Compare October 15, 2025 11:49

Alvaro-Kothe suggested changes Oct 19, 2025

View reviewed changes

update test

77422bf

mathbruu force-pushed the bugfix-Issue62340 branch from 2dad8cc to 77422bf Compare November 18, 2025 08:28

mathbruu and others added 2 commits November 24, 2025 08:50

Merge branch 'main' into bugfix-Issue62340

4d2b7af

format file

04cd920

Alvaro-Kothe reviewed Nov 25, 2025

View reviewed changes

justine202429 force-pushed the bugfix-Issue62340 branch from 6b9525e to 9b84372 Compare December 1, 2025 10:38

Justine Kosinski added 2 commits December 1, 2025 11:45

STYLE: Fix whitespace in ExcelFormatter and TestExcelWriter for consi…

0a277de

…stency

fix format

b7f867a

Uh oh!

BUG : Fix Excel header NaN duplication with merged MultiIndex columns in to_excel #62576

Are you sure you want to change the base?

BUG : Fix Excel header NaN duplication with merged MultiIndex columns in to_excel #62576

Uh oh!

Conversation

justine202429 commented Oct 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Alvaro-Kothe left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mathbruu commented Nov 18, 2025

Uh oh!

mathbruu commented Nov 24, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

justine202429 commented Oct 4, 2025 •

edited

Loading