• LogParser v0.8.2, last updated at 2026-03-13 15:05:41, http://scrapyd-0:6800/logs/sourcing_v2/manheim.gb/2026-03-13T14_52_48.json

PROJECT (sourcing_v2), SPIDER (manheim.gb)

  • Log analysis
  • Log categorization
  • Progress visualization
  • View log
  • projectsourcing_v2
    spidermanheim.gb
    job2026-03-13T14_52_48
    first_log_time2026-03-13 14:52:53
    latest_log_time2026-03-13 15:05:39
    runtime0:12:46
    crawled_pages 1243
    scraped_items 154
    shutdown_reasonReceived SIGTERM twice
    finish_reasonN/A
    log_critical_count0
    log_error_count181
    log_warning_count23
    log_redirect_count0
    log_retry_count0
    log_ignore_count0
    latest_crawl
    latest_scrape
    latest_log
    current_time
    latest_itemN/A
    • WARNING+

    • error_logs
      last 10 of 181

      2026-03-13 15:04:43 [crawlers.pipelines.post_to_api] ERROR: manheim.gb, item_id a548c274-5845-5e3f-9680-69b9b24f8dfc: Failed to post item to https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
       | Status Code: 400
       | Response Content: {"error":"Command deserialization failed. Check field types \u2014 a null value may have been sent for a non-nullable field.","path":"/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing","requestBody":"{\u0022Color\u0022: \u0022BLACK\u0022, \u0022OriginalPhotoCount\u0022: 0, \u0022SellerName\u0022: \u0022Gates Group\u0022, \u0022PriceIncludesVat\u0022: false, \u0022Title\u0022: \u0022FORD B-MAX 1.4 Zetec 5dr\u0022, \u0022Price\u0022: {\u0022AmountSmallestUnit\u0022: 0, \u0022CurrencyCode\u0022: \u0022GBP\u0022}, \u0022Make\u0022: \u0022FORD\u0022, \u0022Model\u0022: \u0022B-MAX\u0022, \u0022Transmission\u0022: \u0022manual\u0022, \u0022Engine\u0022: \u00221.4L\u0022, \u0022FuelType\u0022: \u0022Petrol\u0022, \u0022Odometer\u0022: {\u0022Value\u0022: null, \u0022Unit\u0022: 2}, \u0022RegistrationDate\u0022: \u00222015-03-31\u0022, \u0022LinkToWebOffer\u0022: \u0022https://www.manheim.co.uk/vehicle-detail/FORD/B-MAX?id=4084882\u0022, \u0022Location\u0022: \u0022Northampton - In Transit\u0022, \u0022OriginCountryCode\u0022: \u0022GB\u0022, \u0022Trim\u0022: \u00221.4 Zetec 5dr\u0022, \u0022LicensePlate\u0022: \u0022AK15HDX\u0022, \u0022Year\u0022: 2015, \u0022Listing\u0022: \u0022auction\u0022, \u0022CO2EmissionValue\u0022: 139, \u0022CarExpertReportSourceUrl\u0022: null, \u0022ExpirationDate\u0022: \u00222026-03-17T09:00:00Z\u0022, \u0022IsVatDeductible\u0022: false, \u0022IsVatIncluded\u0022: false, \u0022Id\u0022: \u0022a548c274-5845-5e3f-9680-69b9b24f8dfc\u0022}"}
       | Response Headers: {'Content-Type': 'application/json', 'Date': 'Fri, 13 Mar 2026 15:04:42 GMT', 'Request-Context': 'appId=cid-v1:a0fd5208-3a60-4573-91fc-d1324e0cd36d', 'Server': 'Kestrel', 'Strict-Transport-Security': 'max-age=31536000; includeSubDomains; preload', 'Transfer-Encoding': 'chunked'}
      
      2026-03-13 15:04:43 [crawlers.pipelines.post_to_api] ERROR: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
      Traceback (most recent call last):
        File "/usr/src/app/crawlers/pipelines/post_to_api.py", line 94, in process_item
          response.raise_for_status()
        File "/usr/local/lib/python3.11/dist-packages/requests/models.py", line 1021, in raise_for_status
          raise HTTPError(http_error_msg, response=self)
      requests.exceptions.HTTPError: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
      2026-03-13 15:04:43 [scrapy.core.scraper] ERROR: Error processing {'auction_title': '',
       'c_o_2_emission_value': 139,
       'car_expert_report_source_url': None,
       'color': 'BLACK',
       'currency': 'GBP',
       'engine': '1.4L',
       'expiration_date': '2026-03-17T09:00:00Z',
       'fuel_type': 'Petrol',
       'id': 'a548c274-5845-5e3f-9680-69b9b24f8dfc',
       'is_damaged': False,
       'is_operable': True,
       'is_vat_deductible': False,
       'is_vat_included': False,
       'license_plate': 'AK15HDX',
       'link_to_web_offer': 'https://www.manheim.co.uk/vehicle-detail/FORD/B-MAX?id=4084882',
       'listing': 'auction',
       'location': 'Northampton - In Transit',
       'make': 'FORD',
       'model': 'B-MAX',
       'odometer': None,
       'odometer_reading_unit': 'mi',
       'origin_country_code': 'GB',
       'original_photo_count': 0,
       'price': 0,
       'price_includes_vat': False,
       'registration_date': '2015-03-31',
       'seller_name': 'Gates Group',
       'title': 'FORD B-MAX 1.4 Zetec 5dr',
       'transmission': 'manual',
       'trim': '1.4 Zetec 5dr',
       'year': 2015}
      Traceback (most recent call last):
        File "/usr/local/lib/python3.11/dist-packages/twisted/internet/defer.py", line 1078, in _runCallbacks
          current.result = callback(  # type: ignore[misc]
        File "/usr/local/lib/python3.11/dist-packages/scrapy/utils/defer.py", line 340, in f
          return deferred_from_coro(coro_f(*coro_args, **coro_kwargs))
        File "/usr/src/app/crawlers/pipelines/post_to_api.py", line 114, in process_item
          raise e
        File "/usr/src/app/crawlers/pipelines/post_to_api.py", line 94, in process_item
          response.raise_for_status()
        File "/usr/local/lib/python3.11/dist-packages/requests/models.py", line 1021, in raise_for_status
          raise HTTPError(http_error_msg, response=self)
      requests.exceptions.HTTPError: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
      2026-03-13 15:04:53 [crawlers.pipelines.post_to_api] ERROR: manheim.gb, item_id 3e6af9fb-e0b9-5e69-a355-1523efc68d7b: Failed to post item to https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
       | Status Code: 400
       | Response Content: {"error":"Command deserialization failed. Check field types \u2014 a null value may have been sent for a non-nullable field.","path":"/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing","requestBody":"{\u0022Color\u0022: \u0022GREY\u0022, \u0022OriginalPhotoCount\u0022: 0, \u0022SellerName\u0022: \u0022Allen Ford\u0022, \u0022PriceIncludesVat\u0022: false, \u0022Title\u0022: \u0022FORD TRANSIT 2.0 EcoBlue 185ps Low Roof D/Cab Sport Van Auto\u0022, \u0022Price\u0022: {\u0022AmountSmallestUnit\u0022: 0, \u0022CurrencyCode\u0022: \u0022GBP\u0022}, \u0022Make\u0022: \u0022FORD\u0022, \u0022Model\u0022: \u0022TRANSIT\u0022, \u0022Transmission\u0022: \u0022automatic\u0022, \u0022Engine\u0022: \u00222.0L\u0022, \u0022FuelType\u0022: \u0022Diesel\u0022, \u0022Odometer\u0022: {\u0022Value\u0022: null, \u0022Unit\u0022: 2}, \u0022RegistrationDate\u0022: \u00222021-01-29\u0022, \u0022LinkToWebOffer\u0022: \u0022https://www.manheim.co.uk/vehicle-detail/FORD/TRANSIT?id=4085056\u0022, \u0022Location\u0022: \u0022Northampton - In Transit\u0022, \u0022OriginCountryCode\u0022: \u0022GB\u0022, \u0022Trim\u0022: \u00222.0 EcoBlue 185ps Low Roof D/Cab Sport Van Auto\u0022, \u0022LicensePlate\u0022: \u0022YL70FJY\u0022, \u0022Year\u0022: 2021, \u0022Listing\u0022: \u0022auction\u0022, \u0022CO2EmissionValue\u0022: 171, \u0022CarExpertReportSourceUrl\u0022: null, \u0022ExpirationDate\u0022: \u00222026-03-19T09:00:00Z\u0022, \u0022IsVatDeductible\u0022: true, \u0022IsVatIncluded\u0022: false, \u0022Id\u0022: \u00223e6af9fb-e0b9-5e69-a355-1523efc68d7b\u0022}"}
       | Response Headers: {'Content-Type': 'application/json', 'Date': 'Fri, 13 Mar 2026 15:04:52 GMT', 'Request-Context': 'appId=cid-v1:a0fd5208-3a60-4573-91fc-d1324e0cd36d', 'Server': 'Kestrel', 'Strict-Transport-Security': 'max-age=31536000; includeSubDomains; preload', 'Transfer-Encoding': 'chunked'}
      
      2026-03-13 15:04:53 [crawlers.pipelines.post_to_api] ERROR: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
      Traceback (most recent call last):
        File "/usr/src/app/crawlers/pipelines/post_to_api.py", line 94, in process_item
          response.raise_for_status()
        File "/usr/local/lib/python3.11/dist-packages/requests/models.py", line 1021, in raise_for_status
          raise HTTPError(http_error_msg, response=self)
      requests.exceptions.HTTPError: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
      2026-03-13 15:04:53 [scrapy.core.scraper] ERROR: Error processing {'auction_title': '',
       'c_o_2_emission_value': 171,
       'car_expert_report_source_url': None,
       'color': 'GREY',
       'currency': 'GBP',
       'engine': '2.0L',
       'expiration_date': '2026-03-19T09:00:00Z',
       'fuel_type': 'Diesel',
       'id': '3e6af9fb-e0b9-5e69-a355-1523efc68d7b',
       'is_damaged': False,
       'is_operable': True,
       'is_vat_deductible': True,
       'is_vat_included': False,
       'license_plate': 'YL70FJY',
       'link_to_web_offer': 'https://www.manheim.co.uk/vehicle-detail/FORD/TRANSIT?id=4085056',
       'listing': 'auction',
       'location': 'Northampton - In Transit',
       'make': 'FORD',
       'model': 'TRANSIT',
       'odometer': None,
       'odometer_reading_unit': 'mi',
       'origin_country_code': 'GB',
       'original_photo_count': 0,
       'price': 0,
       'price_includes_vat': False,
       'registration_date': '2021-01-29',
       'seller_name': 'Allen Ford',
       'title': 'FORD TRANSIT 2.0 EcoBlue 185ps Low Roof D/Cab Sport Van Auto',
       'transmission': 'automatic',
       'trim': '2.0 EcoBlue 185ps Low Roof D/Cab Sport Van Auto',
       'year': 2021}
      Traceback (most recent call last):
        File "/usr/local/lib/python3.11/dist-packages/twisted/internet/defer.py", line 1078, in _runCallbacks
          current.result = callback(  # type: ignore[misc]
        File "/usr/local/lib/python3.11/dist-packages/scrapy/utils/defer.py", line 340, in f
          return deferred_from_coro(coro_f(*coro_args, **coro_kwargs))
        File "/usr/src/app/crawlers/pipelines/post_to_api.py", line 114, in process_item
          raise e
        File "/usr/src/app/crawlers/pipelines/post_to_api.py", line 94, in process_item
          response.raise_for_status()
        File "/usr/local/lib/python3.11/dist-packages/requests/models.py", line 1021, in raise_for_status
          raise HTTPError(http_error_msg, response=self)
      requests.exceptions.HTTPError: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
      2026-03-13 15:04:54 [crawlers.pipelines.post_to_api] ERROR: manheim.gb, item_id 89398dd7-1067-504c-b46a-1e46fd0124e7: Failed to post item to https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
       | Status Code: 400
       | Response Content: {"error":"Command deserialization failed. Check field types \u2014 a null value may have been sent for a non-nullable field.","path":"/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing","requestBody":"{\u0022Color\u0022: \u0022BRONZE\u0022, \u0022OriginalPhotoCount\u0022: 0, \u0022SellerName\u0022: \u0022Vindis Group\u0022, \u0022PriceIncludesVat\u0022: false, \u0022Title\u0022: \u0022PEUGEOT 3008 1.2 PureTech Allure 5dr\u0022, \u0022Price\u0022: {\u0022AmountSmallestUnit\u0022: 0, \u0022CurrencyCode\u0022: \u0022GBP\u0022}, \u0022Make\u0022: \u0022PEUGEOT\u0022, \u0022Model\u0022: \u00223008\u0022, \u0022Transmission\u0022: \u0022manual\u0022, \u0022Engine\u0022: \u00221.2L\u0022, \u0022FuelType\u0022: \u0022Petrol\u0022, \u0022Odometer\u0022: {\u0022Value\u0022: null, \u0022Unit\u0022: 2}, \u0022RegistrationDate\u0022: \u00222019-05-10\u0022, \u0022LinkToWebOffer\u0022: \u0022https://www.manheim.co.uk/vehicle-detail/PEUGEOT/3008?id=4085070\u0022, \u0022Location\u0022: \u0022Northampton - In Transit\u0022, \u0022OriginCountryCode\u0022: \u0022GB\u0022, \u0022Trim\u0022: \u00221.2 PureTech Allure 5dr\u0022, \u0022LicensePlate\u0022: \u0022OE19RMZ\u0022, \u0022Year\u0022: 2019, \u0022Listing\u0022: \u0022auction\u0022, \u0022CO2EmissionValue\u0022: 120, \u0022CarExpertReportSourceUrl\u0022: null, \u0022ExpirationDate\u0022: \u00222026-03-19T09:00:00Z\u0022, \u0022IsVatDeductible\u0022: false, \u0022IsVatIncluded\u0022: false, \u0022Id\u0022: \u002289398dd7-1067-504c-b46a-1e46fd0124e7\u0022}"}
       | Response Headers: {'Content-Type': 'application/json', 'Date': 'Fri, 13 Mar 2026 15:04:53 GMT', 'Request-Context': 'appId=cid-v1:a0fd5208-3a60-4573-91fc-d1324e0cd36d', 'Server': 'Kestrel', 'Strict-Transport-Security': 'max-age=31536000; includeSubDomains; preload', 'Transfer-Encoding': 'chunked'}
      
      2026-03-13 15:04:54 [crawlers.pipelines.post_to_api] ERROR: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
      Traceback (most recent call last):
        File "/usr/src/app/crawlers/pipelines/post_to_api.py", line 94, in process_item
          response.raise_for_status()
        File "/usr/local/lib/python3.11/dist-packages/requests/models.py", line 1021, in raise_for_status
          raise HTTPError(http_error_msg, response=self)
      requests.exceptions.HTTPError: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
      2026-03-13 15:04:54 [scrapy.core.scraper] ERROR: Error processing {'auction_title': '',
       'c_o_2_emission_value': 120,
       'car_expert_report_source_url': None,
       'color': 'BRONZE',
       'currency': 'GBP',
       'engine': '1.2L',
       'expiration_date': '2026-03-19T09:00:00Z',
       'fuel_type': 'Petrol',
       'id': '89398dd7-1067-504c-b46a-1e46fd0124e7',
       'is_damaged': False,
       'is_operable': True,
       'is_vat_deductible': False,
       'is_vat_included': False,
       'license_plate': 'OE19RMZ',
       'link_to_web_offer': 'https://www.manheim.co.uk/vehicle-detail/PEUGEOT/3008?id=4085070',
       'listing': 'auction',
       'location': 'Northampton - In Transit',
       'make': 'PEUGEOT',
       'model': '3008',
       'odometer': None,
       'odometer_reading_unit': 'mi',
       'origin_country_code': 'GB',
       'original_photo_count': 0,
       'price': 0,
       'price_includes_vat': False,
       'registration_date': '2019-05-10',
       'seller_name': 'Vindis Group',
       'title': 'PEUGEOT 3008 1.2 PureTech Allure 5dr',
       'transmission': 'manual',
       'trim': '1.2 PureTech Allure 5dr',
       'year': 2019}
      Traceback (most recent call last):
        File "/usr/local/lib/python3.11/dist-packages/twisted/internet/defer.py", line 1078, in _runCallbacks
          current.result = callback(  # type: ignore[misc]
        File "/usr/local/lib/python3.11/dist-packages/scrapy/utils/defer.py", line 340, in f
          return deferred_from_coro(coro_f(*coro_args, **coro_kwargs))
        File "/usr/src/app/crawlers/pipelines/post_to_api.py", line 114, in process_item
          raise e
        File "/usr/src/app/crawlers/pipelines/post_to_api.py", line 94, in process_item
          response.raise_for_status()
        File "/usr/local/lib/python3.11/dist-packages/requests/models.py", line 1021, in raise_for_status
          raise HTTPError(http_error_msg, response=self)
      requests.exceptions.HTTPError: 400 Client Error: Bad Request for url: https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
      2026-03-13 15:04:55 [manheim.gb] ERROR: Error extracting listing 4085356: Field transmission is not nullable
    • warning_logs
      last 10 of 23

      2026-03-13 15:04:13 [manheim.gb] WARNING: Skipping listing 4082389: missing registration date
      2026-03-13 15:04:13 [manheim.gb] WARNING: Skipping listing 4082395: missing registration date
      2026-03-13 15:04:13 [manheim.gb] WARNING: Skipping listing 4082397: missing registration date
      2026-03-13 15:04:14 [manheim.gb] WARNING: Skipping listing 4082400: missing registration date
      2026-03-13 15:05:08 [manheim.gb] WARNING: Skipping listing 4086224: missing registration date
      2026-03-13 15:05:09 [manheim.gb] WARNING: Skipping listing 4086320: missing registration date
      2026-03-13 15:05:39 [scrapy.core.downloader.handlers.http11] WARNING: Got data loss in http://images.manheim.co.uk/images/units/zoom/4087605-9999999-58131621-002.jpg. If you want to process broken responses set the setting DOWNLOAD_FAIL_ON_DATALOSS = False -- This message won't be shown in further requests
      2026-03-13 15:05:39 [scrapy.core.downloader.handlers.http11] WARNING: Got data loss in http://images.manheim.co.uk/images/units/zoom/insp-0001-4087681-4095956-13.jpg. If you want to process broken responses set the setting DOWNLOAD_FAIL_ON_DATALOSS = False -- This message won't be shown in further requests
      2026-03-13 15:05:39 [scrapy.core.downloader.handlers.http11] WARNING: Got data loss in http://images.manheim.co.uk/images/units/zoom/b63179dc-7267-448e-ba6e-72bea2d24593_5.jpg. If you want to process broken responses set the setting DOWNLOAD_FAIL_ON_DATALOSS = False -- This message won't be shown in further requests
      2026-03-13 15:05:39 [scrapy.core.downloader.handlers.http11] WARNING: Got data loss in http://images.manheim.co.uk/images/units/zoom/077bf5da-6f3b-4fe8-b279-2a69ea1b8913_ext_1.jpg. If you want to process broken responses set the setting DOWNLOAD_FAIL_ON_DATALOSS = False -- This message won't be shown in further requests

      INFO

      DEBUG

    • scrapy_version

      2.11.2
    • telnet_console

      127.0.0.1:6023
    • telnet_password

      a1e9f4ef42fffa2b
    • latest_duplicate

      2026-03-13 15:03:57 [scrapy.dupefilters] DEBUG: Filtered duplicate request: <POST https://www.manheim.co.uk/search/vehiclebeautyimages/> - no more duplicates will be shown (see DUPEFILTER_DEBUG to show all duplicates)
    • latest_crawl

      2026-03-13 15:05:39 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://images.manheim.co.uk/images/units/zoom/insp-0001-4087655-4095168-13.jpg> (referer: None)
    • latest_scrape

      2026-03-13 15:05:38 [scrapy.core.scraper] DEBUG: Scraped from <200 https://www.manheim.co.uk/search/vehiclebeautyimages/>
    • latest_stat

      2026-03-13 15:04:54 [scrapy.extensions.logstats] INFO: Crawled 1243 pages (at 105 pages/min), scraped 154 items (at 16 items/min)
    • Head

      2026-03-13 14:52:53 [scrapy.utils.log] INFO: Scrapy 2.11.2 started (bot: SourcingV2)
      2026-03-13 14:52:53 [scrapy.utils.log] INFO: Versions: lxml 5.2.2.0, libxml2 2.12.6, cssselect 1.2.0, parsel 1.9.1, w3lib 2.1.2, Twisted 24.3.0, Python 3.11.2 (main, Mar 13 2023, 12:18:29) [GCC 12.2.0], pyOpenSSL 24.1.0 (OpenSSL 3.2.1 30 Jan 2024), cryptography 42.0.7, Platform Linux-5.15.0-1098-azure-x86_64-with-glibc2.36
      2026-03-13 14:52:53 [manheim.gb] INFO: Starting spider manheim.gb
      2026-03-13 14:52:53 [scrapy.addons] INFO: Enabled addons:
      []
      2026-03-13 14:52:53 [asyncio] DEBUG: Using selector: EpollSelector
      2026-03-13 14:52:53 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.asyncioreactor.AsyncioSelectorReactor
      2026-03-13 14:52:53 [scrapy.utils.log] DEBUG: Using asyncio event loop: asyncio.unix_events._UnixSelectorEventLoop
      2026-03-13 14:52:53 [scrapy.extensions.telnet] INFO: Telnet Password: a1e9f4ef42fffa2b
      2026-03-13 14:52:53 [scrapy.middleware] INFO: Enabled extensions:
      ['scrapy.extensions.corestats.CoreStats',
       'scrapy.extensions.telnet.TelnetConsole',
       'scrapy.extensions.memusage.MemoryUsage',
       'scrapy.extensions.feedexport.FeedExporter',
       'scrapy.extensions.logstats.LogStats',
       'scrapy.extensions.closespider.CloseSpider']
      2026-03-13 14:52:53 [scrapy.crawler] INFO: Overridden settings:
      {'BOT_NAME': 'SourcingV2',
       'CLOSESPIDER_TIMEOUT': 7200,
       'DOWNLOAD_MAXSIZE': 52428800,
       'DOWNLOAD_WARNSIZE': 10485760,
       'FEED_EXPORT_ENCODING': 'utf-8',
       'LOG_FILE': '/var/log/scrapyd/logs/sourcing_v2/manheim.gb/2026-03-13T14_52_48.log',
       'LOG_FORMATTER': 'crawlers.log_formatter.SourcingLogFormatter',
       'MEMUSAGE_LIMIT_MB': 2048,
       'MEMUSAGE_WARNING_MB': 1536,
       'NEWSPIDER_MODULE': 'spiders',
       'REQUEST_FINGERPRINTER_CLASS': 'scrapy_zyte_api.ScrapyZyteAPIRequestFingerprinter',
       'REQUEST_FINGERPRINTER_IMPLEMENTATION': '2.7',
       'SPIDER_MODULES': ['spiders', 'auth_check'],
       'TWISTED_REACTOR': 'twisted.internet.asyncioreactor.AsyncioSelectorReactor',
       'USER_AGENT': ''}
      2026-03-13 14:52:53 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with '5857011'
      2026-03-13 14:52:53 [scrapy_zyte_api.handler] INFO: Using a Zyte API key starting with '5857011'
      2026-03-13 14:52:53 [scrapy.middleware] INFO: Enabled downloader middlewares:
      ['scrapy.downloadermiddlewares.offsite.OffsiteMiddleware',
       'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware',
       'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware',
       'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware',
       'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware',
       'scrapy_zyte_api.ScrapyZyteAPIDownloaderMiddleware',
       'scrapy.downloadermiddlewares.retry.RetryMiddleware',
       'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware',
       'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware',
       'scrapy.downloadermiddlewares.redirect.RedirectMiddleware',
       'scrapy.downloadermiddlewares.cookies.CookiesMiddleware',
       'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware',
       'scrapy.downloadermiddlewares.stats.DownloaderStats']
      2026-03-13 14:52:53 [crawlers.middlewares.id_gen_middleware] INFO: Setting up IdGenerationMiddleware
      2026-03-13 14:52:53 [scrapy.middleware] INFO: Enabled spider middlewares:
      ['scrapy.spidermiddlewares.httperror.HttpErrorMiddleware',
       'scrapy_zyte_api.ScrapyZyteAPISpiderMiddleware',
       'crawlers.middlewares.monitoring_spider_middleware.MonitoringSpiderMiddleware',
       'scrapy.spidermiddlewares.referer.RefererMiddleware',
       'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware',
       'scrapy.spidermiddlewares.depth.DepthMiddleware',
       'crawlers.middlewares.photo_download_middleware.PhotoDownloadMiddleware',
       'crawlers.middlewares.report_download_middleware.ReportDownloadMiddleware',
       'crawlers.middlewares.id_gen_middleware.IdGenMiddleware']
      2026-03-13 14:52:53 [azure.identity._credentials.environment] INFO: Incomplete environment configuration for EnvironmentCredential. These variables are set: AZURE_TENANT_ID, AZURE_CLIENT_ID
      2026-03-13 14:52:53 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): login.microsoftonline.com:443
      2026-03-13 14:52:53 [urllib3.connectionpool] DEBUG: https://login.microsoftonline.com:443 "POST /8ea908c1-4e85-4692-bc3f-3646b9b40891/oauth2/v2.0/token HTTP/1.1" 200 2097
      2026-03-13 14:52:53 [azure.identity._credentials.chained] INFO: DefaultAzureCredential acquired a token from WorkloadIdentityCredential
      2026-03-13 14:52:54 [crawlers.pipelines.translation_pipeline] INFO: Loading translations for language: auto
      2026-03-13 14:52:54 [crawlers.pipelines.item_rules_pipeline] INFO: Setting up ItemRules Pipeline
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: invalid_location_for_country.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: damaged_cars_from_auction_title.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: puretech_for_country.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: puretech_for_fr.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: missing_photos.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: damaged_from_info.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: makes_not_allowed.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: not_operable_from_info.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: makes_models_not_allowed.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: keywords_from_title.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: imported_cars.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: invalid_currency.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: missing_mileage.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: keywords_from_auction_title.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: invalid_country_of_origin.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: puretech_for_pt.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: electric_cars.json
      2026-03-13 14:52:54 [crawlers.filter_rules.rules_loader] INFO: Loaded rule: missing_color.json
      2026-03-13 14:52:54 [crawlers.pipelines.post_to_api] INFO: Setting up PostToApi Pipeline pointing to https://api.alx.dev-cluster.alx.tech/api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing
      2026-03-13 14:52:54 [scrapy.middleware] INFO: Enabled item pipelines:
      ['crawlers.pipelines.translation_pipeline.TranslationPipeline',
       'crawlers.pipelines.item_rules_pipeline.ItemRulesPipeline',
       'crawlers.pipelines.post_to_api.PostToApiPipeline']
      2026-03-13 14:52:54 [scrapy.core.engine] INFO: Spider opened
      2026-03-13 14:52:54 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
      2026-03-13 14:52:54 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6023
      2026-03-13 14:52:54 [scrapy.extensions.memusage] INFO: Peak memory usage is 140MiB
      2026-03-13 14:52:54 [scrapy-playwright] INFO: Starting download handler
      
      2026-03-13 14:52:54 [scrapy-playwright] INFO: Starting download handler
      2026-03-13 14:52:59 [scrapy-playwright] INFO: Launching browser firefox
      2026-03-13 14:52:59 [scrapy-playwright] INFO: Browser firefox launched
      2026-03-13 14:52:59 [scrapy-playwright] DEBUG: Browser context started: 'default' (persistent=False, remote=False)
      2026-03-13 14:53:00 [scrapy-playwright] DEBUG: [Context=default] New page created, page count is 1 (1 for all contexts)
      2026-03-13 14:53:00 [scrapy-playwright] DEBUG: [Context=default] Request: <GET https://www.manheim.co.uk/> (resource type: document)
      2026-03-13 14:53:01 [scrapy-playwright] DEBUG: [Context=default] Response: <407 https://www.manheim.co.uk/>
    • Tail

       'original_photo_count': 4,
       'photos': [{'BlobStorageUrl': 'https://alxdev.blob.core.windows.net/top-deal-listing/listing_410ff630-6543-5547-ae72-a20ded1fd770/410ff630-6543-5547-ae72-a20ded1fd770_image_1.jpg',
                   'Extension': 'jpg',
                   'FileSizeInBytes': 72484,
                   'Height': 1000,
                   'OriginUrl': 'http://images.manheim.co.uk/images/units/zoom/awaiting-image-002.jpg',
                   'Width': 1322},
                  {'BlobStorageUrl': 'https://alxdev.blob.core.windows.net/top-deal-listing/listing_410ff630-6543-5547-ae72-a20ded1fd770/410ff630-6543-5547-ae72-a20ded1fd770_image_1.jpg',
                   'Extension': 'jpg',
                   'FileSizeInBytes': 72484,
                   'Height': 1000,
                   'OriginUrl': 'http://images.manheim.co.uk/images/units/zoom/awaiting-image-002.jpg',
                   'Width': 1322},
                  {'BlobStorageUrl': 'https://alxdev.blob.core.windows.net/top-deal-listing/listing_a5c7f08b-99ec-582f-ac4d-ffcc000f935b/a5c7f08b-99ec-582f-ac4d-ffcc000f935b_image_2.jpg',
                   'Extension': 'jpg',
                   'FileSizeInBytes': 223502,
                   'Height': 1000,
                   'OriginUrl': 'http://images.manheim.co.uk/images/units/zoom/insp-0001-4087514-4095126-3.jpg',
                   'Width': 1333},
                  {'BlobStorageUrl': 'https://alxdev.blob.core.windows.net/top-deal-listing/listing_a5c7f08b-99ec-582f-ac4d-ffcc000f935b/a5c7f08b-99ec-582f-ac4d-ffcc000f935b_image_3.jpg',
                   'Extension': 'jpg',
                   'FileSizeInBytes': 356638,
                   'Height': 1000,
                   'OriginUrl': 'http://images.manheim.co.uk/images/units/zoom/insp-0001-4087514-4095126-13.jpg',
                   'Width': 1333}],
       'price': 0,
       'price_includes_vat': False,
       'registration_date': '2023-02-20',
       'seller_name': 'Santander Consumer',
       'title': 'MAZDA 3 2.0 e-Skyactiv G MHEV Sport Lux 5dr',
       'transmission': 'manual',
       'trim': '2.0 e-Skyactiv G MHEV Sport Lux 5dr',
       'year': 2023}
      2026-03-13 15:05:38 [manheim.gb] INFO: Saving data for 4087514: {'created_time': 1773414338.19649, 'last_price_update_time': 1773414338.1965, 'auction_closing_time': 1773738000.0}
      2026-03-13 15:05:38 [crawlers.pipelines.translation_pipeline] INFO: Spider: manheim.gb, Translating item: c94f46e5-14b7-5e7c-8e09-0539d1ad142c with identifier: 4087345
      2026-03-13 15:05:38 [crawlers.pipelines.item_rules_pipeline] INFO: Spider: manheim.gb, Applying rules to item: c94f46e5-14b7-5e7c-8e09-0539d1ad142c with identifier: 4087345
      2026-03-13 15:05:38 [crawlers.pipelines.post_to_api] INFO: Spider: manheim.gb, Posting item: c94f46e5-14b7-5e7c-8e09-0539d1ad142c with identifier: 4087345 to the API
      2026-03-13 15:05:38 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): api.alx.dev-cluster.alx.tech:443
      2026-03-13 15:05:38 [urllib3.connectionpool] DEBUG: https://api.alx.dev-cluster.alx.tech:443 "POST /api/command/Alx.Cars.Contracts.Internal.Sourcing.AddListing HTTP/1.1" 200 None
      2026-03-13 15:05:38 [scrapy.core.scraper] DEBUG: Scraped from <200 https://www.manheim.co.uk/search/vehiclebeautyimages/>
      {'auction_title': '',
       'c_o_2_emission_value': 110,
       'car_expert_report_source_url': None,
       'color': 'SILVER',
       'currency': 'GBP',
       'engine': '1.6L',
       'expiration_date': '2026-03-25T09:00:00Z',
       'fuel_type': 'Diesel',
       'id': 'c94f46e5-14b7-5e7c-8e09-0539d1ad142c',
       'is_damaged': False,
       'is_operable': True,
       'is_vat_deductible': True,
       'is_vat_included': False,
       'license_plate': 'LT19VRV',
       'link_to_web_offer': 'https://www.manheim.co.uk/vehicle-detail/PEUGEOT/PARTNER?id=4087345',
       'listing': 'auction',
       'location': 'Haydock',
       'make': 'PEUGEOT',
       'model': 'PARTNER',
       'odometer': 20508,
       'odometer_reading_unit': 'mi',
       'origin_country_code': 'GB',
       'original_photo_count': 0,
       'price': 0,
       'price_includes_vat': False,
       'registration_date': '2019-03-31',
       'seller_name': 'Novuna Vehicle Solutions',
       'title': 'PEUGEOT PARTNER 850 SE 1.6 BlueHDi 100 Van ATV',
       'transmission': 'manual',
       'trim': '850 SE 1.6 BlueHDi 100 Van ATV',
       'year': 2019}
      2026-03-13 15:05:38 [manheim.gb] INFO: Saving data for 4087345: {'created_time': 1773414338.670039, 'last_price_update_time': 1773414338.670048, 'auction_closing_time': 1774429200.0}
      2026-03-13 15:05:38 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://images.manheim.co.uk/images/units/zoom/insp-0001-4087621-4118128-2.jpg> (referer: None)
      2026-03-13 15:05:38 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://images.manheim.co.uk/images/units/zoom/b63179dc-7267-448e-ba6e-72bea2d24593_ext_7.jpg> (referer: None)
      2026-03-13 15:05:38 [scrapy.core.engine] INFO: Closing spider (shutdown)
      2026-03-13 15:05:38 [manheim.gb] INFO: Found listing with ID: 4087749
      2026-03-13 15:05:38 [manheim.gb] INFO: Scrape type for 4087749: 1
      2026-03-13 15:05:38 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/4da62462-545c-4337-96bb-81e96c7694bd_ext_2.jpg not downloaded yet
      2026-03-13 15:05:38 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/ca14fcc8-f164-43ef-aef6-db1c4eb7dfe8_ext_3.jpg not downloaded yet
      2026-03-13 15:05:38 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/4087605-9999999-58131621-004.jpg not downloaded yet
      2026-03-13 15:05:38 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/09afc6b8-7e28-46a0-a13c-a7cd3996ca96_ext_7.jpg not downloaded yet
      2026-03-13 15:05:38 [common.azure_blobstore] INFO: Creating container client for top-deal-listing
      2026-03-13 15:05:38 [common.azure_blobstore] INFO: Container_client: <azure.storage.blob._container_client.ContainerClient object at 0x7f74415f8bd0>
      2026-03-13 15:05:38 [common.azure_blobstore] INFO: Creating container if not exists
      2026-03-13 15:05:38 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing?restype=container HTTP/1.1" 409 230
      2026-03-13 15:05:38 [common.azure_blobstore] INFO: Container already exists, skipping creation
      2026-03-13 15:05:38 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing/listing_8f791898-e176-5711-9027-39cfd8ba1761/8f791898-e176-5711-9027-39cfd8ba1761_image_8.jpg HTTP/1.1" 201 0
      2026-03-13 15:05:38 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/98490054-dcf5-4716-a473-97b46a41063c_4.jpg saved to cache
      2026-03-13 15:05:38 [crawlers.middlewares.photo_download_middleware] INFO: Downloaded image 8 for item 8f791898-e176-5711-9027-39cfd8ba1761
      2026-03-13 15:05:38 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/b63179dc-7267-448e-ba6e-72bea2d24593_9.jpg not downloaded yet
      2026-03-13 15:05:38 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/077bf5da-6f3b-4fe8-b279-2a69ea1b8913_10.jpg not downloaded yet
      2026-03-13 15:05:38 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/88da649c-5719-43d9-8ac2-a2804e6d0379_ext_5.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/cd72712c-bb65-4d10-b9a7-51a4d2a953ee_ext_1.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/44d63232-f268-4135-85f3-c8aa843785dd_ext_1.jpg not downloaded yet
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container client for top-deal-listing
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container_client: <azure.storage.blob._container_client.ContainerClient object at 0x7f744120d010>
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container if not exists
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing?restype=container HTTP/1.1" 409 230
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container already exists, skipping creation
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing/listing_2960b3f3-ca0f-5ee6-914f-ad4cb0587671/2960b3f3-ca0f-5ee6-914f-ad4cb0587671_image_0.jpg HTTP/1.1" 201 0
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/077bf5da-6f3b-4fe8-b279-2a69ea1b8913_ext_7.jpg saved to cache
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Downloaded image 0 for item 2960b3f3-ca0f-5ee6-914f-ad4cb0587671
      2026-03-13 15:05:39 [crawlers.middlewares.id_gen_middleware] INFO: Generated ID for item: 48c4d3a0-3537-5285-a31f-c75e0d16371c with identifier: 4087689
      2026-03-13 15:05:39 [crawlers.middlewares.report_download_middleware] INFO: Skipping car_expert_report download for item 48c4d3a0-3537-5285-a31f-c75e0d16371c with scrape type 1
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Downloading 10 photos for item 48c4d3a0-3537-5285-a31f-c75e0d16371c
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/insp-0001-4087689-4103645-13.jpg not downloaded yet
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container client for top-deal-listing
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container_client: <azure.storage.blob._container_client.ContainerClient object at 0x7f7441789650>
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container if not exists
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing?restype=container HTTP/1.1" 409 230
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container already exists, skipping creation
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing/listing_02500193-7504-50cf-8a27-9573a82db584/02500193-7504-50cf-8a27-9573a82db584_image_0.jpg HTTP/1.1" 201 0
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/b63179dc-7267-448e-ba6e-72bea2d24593_ext_7.jpg saved to cache
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Downloaded image 0 for item 02500193-7504-50cf-8a27-9573a82db584
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container client for top-deal-listing
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container_client: <azure.storage.blob._container_client.ContainerClient object at 0x7f7442c34890>
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container if not exists
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing?restype=container HTTP/1.1" 409 230
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container already exists, skipping creation
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing/listing_38e93817-abc1-5793-9213-4ed08d82eb0c/38e93817-abc1-5793-9213-4ed08d82eb0c_image_8.jpg HTTP/1.1" 201 0
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/62bc7c4c-6e06-4d49-8e5a-09a70f8c8692_ext_4.jpg saved to cache
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Downloaded image 8 for item 38e93817-abc1-5793-9213-4ed08d82eb0c
      2026-03-13 15:05:39 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://images.manheim.co.uk/images/units/zoom/insp-0001-4087621-4118128-3.jpg> (referer: None)
      2026-03-13 15:05:39 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://images.manheim.co.uk/images/units/zoom/077bf5da-6f3b-4fe8-b279-2a69ea1b8913_ext_7.jpg> (referer: None)
      2026-03-13 15:05:39 [scrapy.core.engine] DEBUG: Crawled (200) <POST https://www.manheim.co.uk/search/vehiclebeautyimages/> (referer: https://www.manheim.co.uk/search)
      2026-03-13 15:05:39 [manheim.gb] INFO: Found listing with ID: 4087753
      2026-03-13 15:05:39 [manheim.gb] INFO: Scrape type for 4087753: 1
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/4da62462-545c-4337-96bb-81e96c7694bd_9.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/ca14fcc8-f164-43ef-aef6-db1c4eb7dfe8_ext_5.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/insp-0001-4087605-4098629-13.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/09afc6b8-7e28-46a0-a13c-a7cd3996ca96_ext_6.jpg not downloaded yet
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container client for top-deal-listing
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container_client: <azure.storage.blob._container_client.ContainerClient object at 0x7f7441780d10>
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container if not exists
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing?restype=container HTTP/1.1" 409 230
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container already exists, skipping creation
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing/listing_9e4f739e-a548-5f87-9f4b-61bc1564d86f/9e4f739e-a548-5f87-9f4b-61bc1564d86f_image_0.jpg HTTP/1.1" 201 0
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/insp-0001-4087621-4118128-1.jpg saved to cache
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Downloaded image 0 for item 9e4f739e-a548-5f87-9f4b-61bc1564d86f
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/b63179dc-7267-448e-ba6e-72bea2d24593_ext_2.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/077bf5da-6f3b-4fe8-b279-2a69ea1b8913_9.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/88da649c-5719-43d9-8ac2-a2804e6d0379_ext_2.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/cd72712c-bb65-4d10-b9a7-51a4d2a953ee_5.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/44d63232-f268-4135-85f3-c8aa843785dd_5.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/ebecebd4-d247-465f-bbe6-892ae5305171_ext_7.jpg not downloaded yet
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container client for top-deal-listing
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container_client: <azure.storage.blob._container_client.ContainerClient object at 0x7f744200ae10>
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container if not exists
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing?restype=container HTTP/1.1" 409 230
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container already exists, skipping creation
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing/listing_9e4f739e-a548-5f87-9f4b-61bc1564d86f/9e4f739e-a548-5f87-9f4b-61bc1564d86f_image_1.jpg HTTP/1.1" 201 0
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/insp-0001-4087621-4118128-2.jpg saved to cache
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Downloaded image 1 for item 9e4f739e-a548-5f87-9f4b-61bc1564d86f
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container client for top-deal-listing
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container_client: <azure.storage.blob._container_client.ContainerClient object at 0x7f7441111710>
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container if not exists
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing?restype=container HTTP/1.1" 409 230
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container already exists, skipping creation
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing/listing_02500193-7504-50cf-8a27-9573a82db584/02500193-7504-50cf-8a27-9573a82db584_image_1.jpg HTTP/1.1" 201 0
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/b63179dc-7267-448e-ba6e-72bea2d24593_ext_7.jpg saved to cache
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Downloaded image 1 for item 02500193-7504-50cf-8a27-9573a82db584
      2026-03-13 15:05:39 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://images.manheim.co.uk/images/units/zoom/insp-0001-4087674-4114088-46.jpg> (referer: None)
      2026-03-13 15:05:39 [manheim.gb] INFO: Found listing with ID: 4087765
      2026-03-13 15:05:39 [manheim.gb] INFO: Scrape type for 4087765: 1
      2026-03-13 15:05:39 [scrapy.crawler] INFO: Received SIGTERM twice, forcing unclean shutdown
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/4da62462-545c-4337-96bb-81e96c7694bd_10.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/ca14fcc8-f164-43ef-aef6-db1c4eb7dfe8_ext_2.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/4087605-9999999-58131621-012.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/09afc6b8-7e28-46a0-a13c-a7cd3996ca96_ext_1.jpg not downloaded yet
      2026-03-13 15:05:39 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://images.manheim.co.uk/images/units/zoom/077bf5da-6f3b-4fe8-b279-2a69ea1b8913_ext_6.jpg> (referer: None)
      2026-03-13 15:05:39 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://images.manheim.co.uk/images/units/zoom/insp-0001-4087655-4095168-13.jpg> (referer: None)
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/077bf5da-6f3b-4fe8-b279-2a69ea1b8913_ext_2.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/88da649c-5719-43d9-8ac2-a2804e6d0379_9.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/cd72712c-bb65-4d10-b9a7-51a4d2a953ee_4.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/44d63232-f268-4135-85f3-c8aa843785dd_4.jpg not downloaded yet
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/ebecebd4-d247-465f-bbe6-892ae5305171_ext_4.jpg not downloaded yet
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container client for top-deal-listing
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container_client: <azure.storage.blob._container_client.ContainerClient object at 0x7f7441f9bfd0>
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container if not exists
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing?restype=container HTTP/1.1" 409 230
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container already exists, skipping creation
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing/listing_9e4f739e-a548-5f87-9f4b-61bc1564d86f/9e4f739e-a548-5f87-9f4b-61bc1564d86f_image_2.jpg HTTP/1.1" 201 0
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/insp-0001-4087621-4118128-3.jpg saved to cache
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Downloaded image 2 for item 9e4f739e-a548-5f87-9f4b-61bc1564d86f
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container client for top-deal-listing
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container_client: <azure.storage.blob._container_client.ContainerClient object at 0x7f7442ccf2d0>
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Creating container if not exists
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing?restype=container HTTP/1.1" 409 230
      2026-03-13 15:05:39 [common.azure_blobstore] INFO: Container already exists, skipping creation
      2026-03-13 15:05:39 [urllib3.connectionpool] DEBUG: https://alxdev.blob.core.windows.net:443 "PUT /top-deal-listing/listing_2960b3f3-ca0f-5ee6-914f-ad4cb0587671/2960b3f3-ca0f-5ee6-914f-ad4cb0587671_image_1.jpg HTTP/1.1" 201 0
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/077bf5da-6f3b-4fe8-b279-2a69ea1b8913_ext_7.jpg saved to cache
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Downloaded image 1 for item 2960b3f3-ca0f-5ee6-914f-ad4cb0587671
      2026-03-13 15:05:39 [crawlers.middlewares.id_gen_middleware] INFO: Generated ID for item: 083f6fd4-7fc9-5b97-b78b-fcdd31f48b68 with identifier: 4087736
      2026-03-13 15:05:39 [crawlers.middlewares.report_download_middleware] INFO: Skipping car_expert_report download for item 083f6fd4-7fc9-5b97-b78b-fcdd31f48b68 with scrape type 1
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Downloading 10 photos for item 083f6fd4-7fc9-5b97-b78b-fcdd31f48b68
      2026-03-13 15:05:39 [crawlers.middlewares.photo_download_middleware] INFO: Image http://images.manheim.co.uk/images/units/zoom/5c3c315f-3565-4e92-80e6-7e6011e3e0b2_ext_7.jpg not downloaded yet
      2026-03-13 15:05:39 [scrapy.core.downloader.handlers.http11] WARNING: Got data loss in http://images.manheim.co.uk/images/units/zoom/4087605-9999999-58131621-002.jpg. If you want to process broken responses set the setting DOWNLOAD_FAIL_ON_DATALOSS = False -- This message won't be shown in further requests
      2026-03-13 15:05:39 [scrapy.core.downloader.handlers.http11] WARNING: Got data loss in http://images.manheim.co.uk/images/units/zoom/insp-0001-4087681-4095956-13.jpg. If you want to process broken responses set the setting DOWNLOAD_FAIL_ON_DATALOSS = False -- This message won't be shown in further requests
      2026-03-13 15:05:39 [scrapy.core.downloader.handlers.http11] WARNING: Got data loss in http://images.manheim.co.uk/images/units/zoom/b63179dc-7267-448e-ba6e-72bea2d24593_5.jpg. If you want to process broken responses set the setting DOWNLOAD_FAIL_ON_DATALOSS = False -- This message won't be shown in further requests
      2026-03-13 15:05:39 [scrapy.core.downloader.handlers.http11] WARNING: Got data loss in http://images.manheim.co.uk/images/units/zoom/077bf5da-6f3b-4fe8-b279-2a69ea1b8913_ext_1.jpg. If you want to process broken responses set the setting DOWNLOAD_FAIL_ON_DATALOSS = False -- This message won't be shown in further requests
    • Log

      /1/log/utf8/sourcing_v2/manheim.gb/2026-03-13T14_52_48/?job_finished=True

    • Source

      http://scrapyd-0:6800/logs/sourcing_v2/manheim.gb/2026-03-13T14_52_48.log