wget
General Usage
wget -Flags URL
Full site mirror
wget -mkprE -np -nc -l inf -e robots=off --restrict-file-names=windows -D example.com --no-check-certificate http://example.com
Flags
--mirror / -mTurns on all options to make a mirror copy of a site (`--recursive`, `--timestamping`, `--level=inf`, `--no-remove-listing`)--convert-links / -kConverts the links in the downloaded documents to make them suitable for offline viewing. This means that all the links will point to the local files instead of the original URLs.adjust-extension / -EAdjusts the file extension based on the MIME type. For example, if a file is served as HTML but doesn't have an .html extension, this flag will rename the file to include the .html extension--page-requisites / -pDownloads all the resources that a page requires to display properly, such as images, CSS, and JavaScript files.--restrict-file-names=windowsModifies filenames so that they are compatible with Windows. This avoids issues with characters that are not allowed in Windows filenames (e.g : or * ). `--domains example.com`: - Restricts the download to the specified domain (example.com in this case). wget will only download files from this domain and no other domains.--no-clobber / -ncPrevents wget from overwriting existing files. If a file already exists, wget will skip the download of that file.--no-check-certificateDisables SSL/TLS certificate validation. This can be useful if the server's certificate is self-signed or otherwise not trusted by default.-e robots=offIgnores the robots.txt file on the server, allowing wget to download pages and resources that might otherwise be disallowed by the server's robots.txt file. Use this option responsibly.--recursive / -rEnables recursive downloading. This means wget will follow links found in the downloaded files and download those files as well. This is necessary for downloading entire websites or sections of websites.--level=infSpecifies the maximum recursion depth. Setting it to inf (infinity) means wget will follow links at any depth, effectively downloading the entire site as long as links are found.--no-parent / -npPrevents wget from following links outside the specified directory. This means it won't ascend to parent directories but will stay within the directory structure of the specified URL.