Wrong url in driver #22

Tamplier · 2018-12-21T07:59:05Z

Sometimes I receive different URL addresses in response and driver.

self.logger.warning("Request response url: %s" % response.url)
self.logger.warning("Request driver url: %s" % response.meta['driver'].current_url)

And more than that, driver url sometimes duplicates urls from other responces (at the same time response urls are unique).

The text was updated successfully, but these errors were encountered:

Tamplier · 2018-12-21T11:57:23Z

Oh, I understand. It's a bad idea to share a single instance of WebDriver because parse happens parallel with new requests.

response.meta['driver'].find_element_by_css_selector('button.btn.btn-primary').click()
response = response.replace(body=response.meta['driver'].page_source)

It's how I tried to use driver. It leads to a situation when response will be replaced with body from other url (which is in the driver at the moment)

clemfromspace · 2019-01-01T12:31:44Z

Hi @Tamplier,

Thanks for opening this!
What you experiencing looks similar to #21, and you are right about the single instance of the webdriver.
Exposing the driver in the response meta is not working as expected since many requests are happening at the same time. Unless I find a workaround (creating one webdriver for each parallel request?) I think I will have to stop exposing the driver from the meta.

Another problem from another issue is that since only one instance of webdriver is created, it can slow down the entire Scrapy request / response processing...

I don't have any solution right now for theses problems, but I am currently working on another project who is at least solving the second one: https://github.com/clemfromspace/scrapy-puppeteer (Fully asynchronous webdriver using puppeteer instead of Selenium).

xtan9 · 2021-08-18T19:08:25Z

I'm facing the same problem. Any updates?

ospaarmann · 2022-03-01T19:15:09Z

I am facing the same issue. I created an issue before I found this one, so I'm going to link it in order to focus the discussion about the topic: #111

clemfromspace mentioned this issue Jan 1, 2019

meta data not in sync with driver #21

Closed

xtan9 mentioned this issue Aug 18, 2021

How to perform a click button with scrapy-selenium? #85

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong url in driver #22

Wrong url in driver #22

Tamplier commented Dec 21, 2018

Tamplier commented Dec 21, 2018

clemfromspace commented Jan 1, 2019

xtan9 commented Aug 18, 2021

ospaarmann commented Mar 1, 2022

Wrong url in driver #22

Wrong url in driver #22

Comments

Tamplier commented Dec 21, 2018

Tamplier commented Dec 21, 2018

clemfromspace commented Jan 1, 2019

xtan9 commented Aug 18, 2021

ospaarmann commented Mar 1, 2022