Selenium vs. Playwright

Sat, 05 Feb 2022 17:31:35 -0500

Recently I developed a Python program that scrapes a Single-Page Application with Selenium, and then reimplemented it with Playwright. This gave me firsthand knowledge of the difference between the two. In this article, I shall explain my preference of Playwright over Selenium, and share some general experience of scraping single-page applications.

The reason and result of the reimplementation

My scraping program needs to click an anchor element on the target webpage to trigger file download by the browser, wait for the file download to finish, and then do something with the downloaded file. With Selenium, as surprising as it may seem, waiting for file download to finish is not straightforward. According to its official documentation:

Blogs on Ke's Notes and Blogs

Selenium vs. Playwright

The reason and result of the reimplementation