Recent Journal Comments
Below are the most recent comments to the journal entries on tummy.com's website. This page is also available as an RSS feed, simply feed this URL to your RSS new reader.
Date: Friday March 05, at 06:58
In Reply To: mkpkg: Helper to create setup.py for your projects. by Sean Reifschneider
Subject: PasteScript Difference
Author:
Michael Mulich (pumazi)
Do you feel this is any different from using PasteScript?
I've used 'paster create -t basic_package ' for the past year to do exactly what you are doing here. The only difference I can see off the top of my head is that your implementation doesn't have the Paste dependency.
Some could argue that PasteScript is over engineered, but that's what makes it extendable. For instance, the basic_namespace template (e.g. `paster create -t basic_namespace my.package') is built from the basic_package template. And there are other packaging templates built from that for more specific cases.
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Wednesday March 03, at 04:27
In Reply To: mkpkg: Helper to create setup.py for your projects. by Sean Reifschneider
Subject: easy_install mkpkg?
Author:
Marius Gedminas
/me likes
It would be nice if we could easy_install wcfqfu without having to remember long FTP URLs and without having to wait for distutils2 to appear in some future release of Python.
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Thursday February 25, at 07:56
In Reply To: Python developers are the best! by Sean Reifschneider
Subject: Ultra-Sean! henshin!
Author:
David Goodger
Date: Tuesday February 23, at 13:33
In Reply To: ZFS-FUSE Status: Happy stable release after patch, and in Debian. by Sean Reifschneider
Subject: Drive sizes.
Author:
Sean Reifschneider
I'm using a variety of drive sizes for ZFS, mostly 250GB and 500GB, mostly because these are what I already have available. If I were buying drives for a new storage box today I'd probably use 5 2TB 5400RPM drives to give me 6TB of storage with RAID Z2, slower but also using less power.
Of course, RAID Z2 makes smaller drives more attractive, because of less overhead (10 1TB drives would have 8TB data 2TB parity, 5 2TB drives gives 6TB data 4TB parity), so it would mostly be a matter of the math.
10 drives would fit internally on my current layout, so anything under 11 would be my best bet.
Sean
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Sunday February 21, at 14:37
In Reply To: ZFS-FUSE Status: Happy stable release after patch, and in Debian. by Sean Reifschneider
Subject: ZFS
Author:
someone
Date: Wednesday January 27, at 12:11
In Reply To: Rsync and rdiff-backup: Two great tastes that go great together. by Sean Reifschneider
Subject: Yes, there are several rsync copies.
Author:
Sean Reifschneider
Yes, I keep several rsync copies: one for the daily rdiff incremental source and one for the monthly rsync source. I don't have the bandwidth to the backup server to do a full rsync every day -- it's limited by the 768kbps on my home network, and I have 4 or 5 machines to do every day.
So far it has been working extremely well. Once I got the initial setup issues ironed out, it hasn't had any problems.
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Tuesday January 26, at 13:09
In Reply To: Rsync and rdiff-backup: Two great tastes that go great together. by Sean Reifschneider
Subject: Keep Rsyncs?
Author:
Scott
This looks intriguing to me, but I'm curious if you are keeping the rsyncs around (so you only need to xmit the rsync diffs before taking rdiffs) and incurring a ~100% storage requirement increase, or are you removing the rsync copy every time?
Thanks
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Tuesday January 26, at 08:41
In Reply To: Using FreeDOS CD for BIOS updates. by Sean Reifschneider
Subject: cdrom
Author:
Johnny
The recipe looks good. except I could never get the cdrom drive to work off the bat. Would have been nice to see how you did that in more detail
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Friday January 22, at 09:42
In Reply To: Rsync and rdiff-backup: Two great tastes that go great together. by Sean Reifschneider
Subject: rdiff-backup
Author:
Paul Mack
I took a look at rdiff-backup and made the Move to it. This makes a lot more sense then rsync with Hardlinks.
Thanx for turning me on to rdiff-backup.
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Saturday January 16, at 21:43
In Reply To: Growing a software RAID-5 array. by Sean Reifschneider
Subject: Thanks
Author:
David Maas
Date: Friday January 08, at 07:28
In Reply To: 18 months of ZFS-FUSE by Sean Reifschneider
Subject: btrfs is a ways off...
Author:
Sean Reifschneider
That sort of article is still a ways off. I had attempted to do some serious use and testing of btrfs a few months ago, but at that time the ability to delete snapshots was not implemented.
They've just implemented this and it'll be in 2.6.33 (Probably Fedora 13 and Ubuntu 10.10, I've been told not in the upcoming 10.04).
At this point I'd say the comparison I'm using is that btrfs has just caught up with ZFS circa 2005. It's got very cool features and sounds very promising, but it's very young. The latter is not really something you want in a file-system. ;-)
File-systems are just hard things to get right. I'm reminded of ext3, which was just a set of changes to ext2, and that took around 4 years to get out and another couple to stabilize.
Sean
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Thursday January 07, at 08:49
In Reply To: iptstate -- "top" for network connections by Sean Reifschneider
Subject: x it out
Author:
alex dekker
The 'x' feature is handy as well. I used this to delete a NAT table entry for a machine that had changed IP address. I couldn't reconnect with a VPN through my linux router until the NAT entry had expired. Instead of waiting for the TTL to expire I was able to erase the connection entry with iptstate.
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Tuesday January 05, at 04:58
In Reply To: 18 months of ZFS-FUSE by Sean Reifschneider
Subject: btrfs
Author:
anon
Date: Saturday January 02, at 18:14
In Reply To: Rsync and rdiff-backup: Two great tastes that go great together. by Sean Reifschneider
Subject: rdiff-backup
Author:
Chris G.
Sean, thanks for your replies. Well, I have basically 2 other problems with rdiff-backup (besides the ones you already noted); it doesn't compress and it can't currently back up ACLs and EAs. And on several setups we need to use them. OTOH I'm somewhat more familiar with tools that create monolithic archives + increments (yes, they're more vulnerable to disk errors), like dar which I started experimenting with lately. And, if possible I'm for a free, open source solution. Anyway, your insight and experience discussed here shed some more light to some other aspects too, thanks again.
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Saturday January 02, at 16:04
In Reply To: Rsync and rdiff-backup: Two great tastes that go great together. by Sean Reifschneider
Subject: BackupPC is good for some things
Author:
Sean Reifschneider
BackupPC is a very nice tool for some situations. I was planning on deploying it for our laptop backups, but the workflow I developed for our normal backups is for Ubuntu and Ubuntu was having issues on the system I deployed for our laptop backups. CentOS 5, which I ended up putting on that host, uses packages that required quite a lot more work above what the existing workflow was.
So, that was the motivation that got me to spend some time trying this other backup mechanism. rdiff-backup has some very nice features.
As you say, some database files do need to be dumped or flushed to get a consistent backup. I was speaking primarily of database files that are append-only, or similar files like log files.
Yes, we have thought of using tapes, but they do not really fit our needs. We have also considered agent-based backup systems, but have dismissed them for similar reasons.
For backing up virtual systems we basically just treat them as if they were normal machines. That has served us well.
Sean
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Saturday January 02, at 07:43
In Reply To: Rsync and rdiff-backup: Two great tastes that go great together. by Sean Reifschneider
Subject: Various thoughts
Author:
Chris G.
Hi. I'm reading your blogs sometimes to always find something interesting. With my startup, I'm planning out backup strategies for several boxes we hope to have more of in the future. One important point is what backup strategy/medium is to use. I think I gather my thought in the form of some questions I hope you find some time to answer in short.
- I've read earlier you are using BackupPC. For a few hosts it's working well for us. Did you not find this tool sufficient, having to look for alternatives (besides the ability of partial updates with other mentioned BU tools)?
- Have you ever thought of using tapes? The equipment is relatively expensive, but tapes are relatively cheap. And no worries of many files and hardlinks. There are worries of other issues, of course. But with that many hosts you're dealing with daily, I'd consider using them...
- Have you thought of using agent-based backups? Many of them can solve the ever-increasing monolithic file backup problems too (well, some of it, most particular being the backup of various database files on MS hosts and Outlook PST files, etc). There are free tools to choose from, too, like A(Z?)manda, Bacula, etc.
- I'm also wondering why you like to backup database files as-is (being one of your primary reasons looking for diffing backup tools). A database datafile or tablespace file might not be in sync with disk contents (TBH on even a moderately intensive workload, it never is), so it's practically impossible to create a consistent, working copy. A mechanism should be used that either syncs/locks database files during backup (which in turn causes a stall in DB access), or other methods should be considered, IMO, like dumping consistent backups of the DB in question, and backing up that dump. Or use facilities that are meant for consistent offline backups, like the binary logs of MySQL, etc.
- I've noticed you have changed your VPS infrastructure to VMWare some time ago (what were your company using before that btw?), do you have any recommendations to look for when doing incremental backups of VMs? We too plan on providing some VPS solution, the most probable solution we'll provide is OpenVZ and/or KVM-based virtualization. The latter uses images similar to VMWare.
- chris
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Thursday December 31, 2009 at 00:16
In Reply To: Auto-Connecting with Cricket Wireless A600 Modem by Sean Reifschneider
Subject: Seems to be working fine.
Author:
Sean Reifschneider
I've been pretty happy with it. The service seems to be the same as Sprint, we haven't seemed to run into the 5GB/month throttling. The card is a little weird with the mode switch under F12, the original Sprint modem that didn't require that was a bit more reliable, sometimes I have issues getting it to switch mode. Under Karmic this seems to be less of an issue.
Sean
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Wednesday December 30, 2009 at 17:32
In Reply To: Auto-Connecting with Cricket Wireless A600 Modem by Sean Reifschneider
Subject: Cricket Wireless
Author:
nrg
What's your opinion of Cricket Wireless' service and their data plans after ~4 months of use?
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Sunday December 27, 2009 at 05:54
In Reply To: 18 months of ZFS-FUSE by Sean Reifschneider
Subject: I wish them success, but...
Author:
Sean Reifschneider
I really hope they succeed at this, but so far it looks to only be a wordpress blog with a post in October and a few replies to comments over the next week or two, and then complete silence. "Check out our twitter feed for the latest developments" -- twitter seems to only have the initial announcement as well.
So, I'm not jumping for joy about this yet. ;-/
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Saturday December 26, 2009 at 13:47
In Reply To: 18 months of ZFS-FUSE by Sean Reifschneider
Subject: ZFS in kernel
Author:
Phobos
"I wish we could have ZFS in the kernel, but the powers that be have decided that will never happen."
Maybe you don't have to wish too hard, as that might come true soon enough: http://kqinfotech.wordpress.com/
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Thursday December 24, 2009 at 06:48
In Reply To: 18 months of ZFS-FUSE by Sean Reifschneider
Subject: zfs-fuse 0.6.0
Author:
Seth
check it out! Performance has increased, it supports more current features etc
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Thursday December 24, 2009 at 01:56
In Reply To: xterm Regex Matching for Cut Selection. by Sean Reifschneider
Subject: xterm config
Author:
me and just me
Ok, I'm even more years late... *G*
I liked your settings, but the quad-click also selects spaces in front of the line content like this:
(go to entry which this is a comment to)
> This is a test \n ^^^^^^^^^^^^^^^^^^^ <- would be markedusing: xterm*on4Clicks: regex [^ \t][^#$]+
> This is a test \n
^^^^^^^^^^^^^^ <- would be marked(go to entry which this is a comment to)
Date: Sunday December 20, 2009 at 17:47
In Reply To: Musings on eBooks by Kevin Fenzi
Subject: If you like Science Fiction ...
Author:
simo
If you like Science Fiction, check add this website to the list of sellers that offer non DRMed books in most formats:
http://www.webscription.net/
There are some available for free and others usually cost around 5$ each.
There are some good books and series there.
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Monday December 14, 2009 at 22:07
In Reply To: Rsync and rdiff-backup: Two great tastes that go great together. by Sean Reifschneider
Subject: Nope, hard-links are what I'm trying to avoid.
Author:
Sean Reifschneider
--link-dest is exactly the sort of "linking trick" that I reference in my original post. It works fine for fairly small sets of data, but it's quite expensive to remove these archived backups (requiring hundreds of thousands or millions or more file delete operations, directory traversal, etc).
The biggest issue is, say you have a 20GB file that every night has 100KB appended to it. Or 100KB within it updated. You know, a fairly typical database file. And you are keeping 30 incremental copies...
With the hard links, this requires 600GB of storage.
Using rdiff-backup or ZFS snapshots, you end up using more like 20.03GB for the same dataset. Or a space savings of 97%.
Disc space is cheap, but it's not that cheap when you're talking about 5 TB versus 150TB...
As far as LVM snapshots go, you have to know beforehand how much space one of these snapshots is going to require, or you have to overcommit and make the snapshot volume larger than you ever expect it to reach, or you have to set up the snapshots to automatically extend when they run short (but not out) of space.
As far as overallocating, I'm looking at having a thousand backup copies on one of our larger systems. Over-allocating by even just 1GB results in a wasted terabyte right there. And I probably can't guess that close to right. Or I have to snapshot the whole backup file-system, and count on each snapshot being rather large, but also that if I move a backup from one host to another, it's still going to have all of that data reserved (probably through the old snapshot copies) for the next year.
Oh, and if you have 30 snapshots of a piece of data and it changes, you have to write and keep 30 copies of that changed volume (one for each snapshot), or you have to have rolling snapshots (one snapshot snapshotting another snapshot, can you do that) and then only be able to trim off the ends, so no trimming from the middle -- you need rolling snapshots for each backup type).
If you have had good luck with setting up and managing 1,000 LVM based snapshots, I'd love to hear about it. However, it seems like it would be a maintenance and performance nightmare. I've toyed with trying it out, since I think LVM snapshots are more robust than ZFS or btrfs are right now, but I just haven't gotten up the urge to try it.
So, the mechanism I wrote about is similar in ideas to many of these, but I believe it's dramatically simpler than LVM snapshots while saving more space than hard links.
The target I'm shooting for is like ZFS snapshots. They automatically manage their space, so you don't have to guess at how much space is going to be used by a snapshot, and it gets allocated out of the main file-system. This is because as blocks are written, they are copied elsewhere (copy on write). And you can create many "light weight" sub-file-systems within the ZFS file-system.
So, I'd create a "backups" ZFS, and then within that make one file-system for each system. For each of those, every night I would take a snapshot. I would delete the snapshots as time went on such that I ended up keeping monthly interval snapshots beyond 6 weeks, and weekly beyond 14 days.
ZFS managed all the complexity behind the scenes.
Sean
(go to entry which this is a comment to)
(go to entry which this is a comment to)
Date: Friday December 11, 2009 at 07:47
In Reply To: Rsync and rdiff-backup: Two great tastes that go great together. by Sean Reifschneider
Subject: rsync with --link-dest
Author:
Paul Mack
You could you rsync with --link-dest "link destination"
I have rsync setup with lvm snapshots and it works very well. I have been running this for about 6 months without any issue.
check out the following URL for more info.
http://www.mikerubel.org/computers/rsync_snapshots/
Paul...
(go to entry which this is a comment to)
(go to entry which this is a comment to)