tar gzip: stdout: File too large - Error Size Limit on Large file > 17GB

The folder I was trying to archive is about 72GB, but much like rsync at about 17GB it chokes because of the filesize.  What's with so many common and essential Linux tools having such limitations?  I guess it is likely that the authors never wrote their code with the idea that files would be so large but it's still very annoying.  It's important to stay on top of these limitations on production servers because I didn't realize what happened until I checked the file with "gzip -t" and the archive was too short.  I then ran the same script again on that directory and recorded the output as shown below from gzip.

time tar -czpf somefile.tar.gz archive-directory

gzip: stdout: File too large

real    33m15.478s
user    0m2.440s
sys     0m34.498s

 

gzip -V
gzip 1.3.5
(2002-09-30)
Copyright 2002 Free Software Foundation
Copyright 1992-1993 Jean-loup Gailly
This program comes with ABSOLUTELY NO WARRANTY.
You may redistribute copies of this program
under the terms of the GNU General Public License.
For more information about these matters, see the file named COPYING.
Compilation options:
DIRENT UTIME STDC_HEADERS HAVE_UNISTD_H HAVE_MEMORY_H HAVE_STRING_H HAVE_LSTAT ASMV
Written by Jean-loup Gailly.

 

Solutions/Alternatives to gzip?

I don't know the specific limitations of other formats but I'm currently trying to tar and bzip2 the same directory and will report back.  I wonder if rar is a viable option and what the maximum size is.  At the very least, rar may make the most sense because you can easily split the archives into 4GB files and this also makes for easy DVD-R backups as well.

bzip2 had the same error at the 17GB mark (and this also shows how much SLOWER bzip2 is with hardly much better compression):

bzip2: I/O or other error, bailing out.  Possible reason follows.
bzip2: File too large
        Input file = (stdin), output file = (stdout)

real    199m7.113s
user    0m2.200s
sys     0m29.714s


Tags:

tar, gzip, stdout, gbthe, folder, archive, gb, rsync, chokes, filesize, essential, linux, limitations, authors, servers, didn, quot, directory, recorded, output, czpf, somefile, gz, user, sys, copyright, software, loup, gailly, warranty, redistribute, copies, gnu, copying, compilation, dirent, utime, stdc_headers, have_unistd_h, have_memory_h, have_string_h, have_lstat, asmv, solutions, alternatives, formats, currently, bzip, rar, viable, maximum, archives, dvd, backups, slower, compression, bailing, follows, input, stdin,

Latest Articles

  • Linux Ubuntu Cannot Print Large Images
  • Cannot Print PDF Solution and Howto Resize
  • Linux Console Login Screen TTY Change Message
  • Apache Cannot Start Listening Already on 0.0.0.0
  • MySQL Bash Query to pipe input directly without using heredoc trick
  • CentOS 6 and 7 / RHEL Persistent DHCP Solution
  • Debian Ubuntu Mint rc-local service startup error solution rc-local.service: Failed at step EXEC spawning /etc/rc.local: Exec format error
  • MySQL Cheatsheet Guide and Tutorial
  • bash script kill whois or other command that is running for too long
  • Linux tftp listens on all interfaces and IPs by DEFAULT Security Risk Hole Solution
  • python import docx error
  • Cisco Unified Communications Manager Express Cheatsheet CUCME CME
  • Linux Ubuntu Debian Missing privilege separation directory: /var/run/sshd
  • bash how to count the number of columns or words in a line
  • bash if statement how to test program output without assigning to variable
  • RTNETLINK answers: Network is unreachable
  • Centos 7 how to save iptables rules like Centos 6
  • nfs tuning maximum amount of connections
  • qemu-kvm error "Could not initialize SDL(No available video device) - exiting"
  • Centos 7 tftpd will not work with selinux enabled