wget not downloading entire file

5,698

Should that:

wget -P <path> <URL>

Your case:

wget -P /dbfs/FileStore/tables http://prod.publicdata.landregistry.gov.uk.s3-website-eu-west-1.amazonaws.com/pp-complete.csv

More detail: wget


Update:

wget -P /dbfs/FileStore/tables  --no-check-certificate --no-proxy http://prod.publicdata.landregistry.gov.uk.s3-website-eu-west-1.amazonaws.com/pp-complete.csv
Share:
5,698

Related videos on Youtube

frank
Author by

frank

Bioinformatician at Kings College London, UK program in perl, R, and C++ mainly

Updated on September 18, 2022

Comments

  • frank
    frank 4 months

    I got historic house prices in UK from: https://www.gov.uk/government/statistical-data-sets/price-paid-data-downloads

    I ran the link for all historic house prices:

    wget http://prod.publicdata.landregistry.gov.uk.s3-website-eu-west-1.amazonaws.com/pp-complete.csv -P /dbfs/FileStore/tables
    

    to download the file.

    I tried multiple times, and get a small file, 7MB:

    enter image description here

    The webpage says it is 4GB:

    enter image description here

    Any suggestions what I can be doing wrong?

    • Melebius
      Melebius over 3 years
      I have tried the same command and have got 473MiB of the file so far. Please check your network connection.
  • frank
    frank over 3 years
    Actually, same issue:2019-09-25 09:29:57 (9.30 MB/s) - ‘/dbfs/FileStore/tables/pp-complete.csv.4’ saved [4301888603/4301888603] was what wget returned, but when I ran the ls -l, -rw-r--r-- 1 root root 6921307 Sep 25 09:30 pp-complete.csv.4
  • N. Bien over 3 years
    Do you have enough free space in your directory? Try to continue downloading : wget -c prod.publicdata.landregistry.gov.uk.s3-website-eu-west-1.ama‌​zon… in the same directory you were during the original download.
  • frank
    frank over 3 years
    as downloading, I got this error: *** WARNING: skipped 2928116 bytes of output *** . The number corresponds to the missing data. Very odd
  • Melebius
    Melebius over 3 years
    @alex Please edit your question to reflect all the details you posted in comments. It helps to keep our Q&A style tidy and could also bring more attention since an edited question gets bounced on the homepage. See also How do comments work?
  • N. Bien over 3 years
    I downloaded this file in Chrome browser and get complete file: -rw-r--r-- 1 biennt biennt 4301888603 Thg 9 26 16:46 pp-complete.csv . So weird!Try clone this repo: URL and run: ./gdown.pl https://drive.google.com/file/d/1Zmp3tf3g1kaMpG949iIFQjZqvLO‌​FJlrr/edit pp-complete.csv
  • frank
    frank over 3 years
    It turned out the system I was using had that webpage access black listed. discussed with security and gained access.