Skip to content

Commit

Permalink
Closes scrapy#6342. Setting METAREFRESH_IGNORE_TAGS to [‘noscript’] b…
Browse files Browse the repository at this point in the history
…y default
  • Loading branch information
aisha-partha committed May 8, 2024
1 parent 4300a1d commit 180bc9b
Show file tree
Hide file tree
Showing 3 changed files with 5 additions and 6 deletions.
4 changes: 2 additions & 2 deletions docs/topics/downloader-middleware.rst
Original file line number Diff line number Diff line change
Expand Up @@ -876,13 +876,13 @@ Whether the Meta Refresh middleware will be enabled.
METAREFRESH_IGNORE_TAGS
^^^^^^^^^^^^^^^^^^^^^^^

Default: ``[]``
Default: ``['noscript']``

Meta tags within these tags are ignored.

.. versionchanged:: 2.0
The default value of :setting:`METAREFRESH_IGNORE_TAGS` changed from
``['script', 'noscript']`` to ``[]``.
``[]`` to ``['noscript']``.

.. setting:: METAREFRESH_MAXDELAY

Expand Down
2 changes: 1 addition & 1 deletion scrapy/settings/default_settings.py
Original file line number Diff line number Diff line change
Expand Up @@ -239,7 +239,7 @@
MEMUSAGE_WARNING_MB = 0

METAREFRESH_ENABLED = True
METAREFRESH_IGNORE_TAGS = []
METAREFRESH_IGNORE_TAGS = ["noscript"]
METAREFRESH_MAXDELAY = 100

NEWSPIDER_MODULE = ""
Expand Down
5 changes: 2 additions & 3 deletions tests/test_downloadermiddleware_redirect.py
Original file line number Diff line number Diff line change
Expand Up @@ -395,9 +395,8 @@ def test_ignore_tags_default(self):
"""content="0;URL='http://example.org/newpage'"></noscript>"""
)
rsp = HtmlResponse(req.url, body=body.encode())
req2 = self.mw.process_response(req, rsp, self.spider)
assert isinstance(req2, Request)
self.assertEqual(req2.url, "http://example.org/newpage")
response = self.mw.process_response(req, rsp, self.spider)
assert isinstance(response, Response)

def test_ignore_tags_1_x_list(self):
"""Test that Scrapy 1.x behavior remains possible"""
Expand Down

0 comments on commit 180bc9b

Please sign in to comment.