PlaywrightCrawler error_handler cannot access Page object

In the Javascript version, the error handler is able to access the `Page` object via `PlaywrightCrawlingContext.page`. I discovered that the Python version doesn't implement this when porting the `ContextPipeline` to Javascript.

### Test case

```py
async def test_error_handler_can_access_page(server_url: URL) -> None:
    crawler = PlaywrightCrawler(max_request_retries=2)

    request_handler = mock.AsyncMock(side_effect=RuntimeError('Intentional crash'))
    crawler.router.default_handler(request_handler)

    error_handler_calls: list[str | None] = []

    @crawler.error_handler
    async def error_handler(context: BasicCrawlingContext | PlaywrightCrawlingContext, _error: Exception) -> None:
        error_handler_calls.append(
            await context.page.content() if isinstance(context, PlaywrightCrawlingContext) else None
        )

    await crawler.run([str(server_url / 'hello-world')])

    assert error_handler_calls == [HELLO_WORLD, HELLO_WORLD, HELLO_WORLD]
```

### Possible solutions

1. Run the error handlers before the cleanup step of the context pipeline
    - this is a fairly big change and we probably want to do it after #1474
    - changing this in the adaptive playwright crawler will be especially tricky
2. Add some "deferred cleanup" step to the context pipeline and call _that_ after error handlers are done
    - it's unclear how this would fit in the current async generator based middleware model
    - considerable refactoring of the `_run_request_handler` and `__run_task_function` would still be necessary - error handlers are called by the latter and context pipeline is only handled in the former


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

PlaywrightCrawler error_handler cannot access Page object #1482

Test case

Possible solutions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

PlaywrightCrawler error_handler cannot access Page object #1482

Description

Test case

Possible solutions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions