1

Storage Management :: Global File Systems and ZFS

One enterprise architecture feature that every admin dreams of is simple storage management. Having one storage network and a global filesystem that all nodes can access, and being able to easily have space “appear” on the file system by adding LUNs to the global file system is a huge time saver.

Redhat Linux

GFS(2)
Redhat has done well with GFS(2) from a systems management perspective (I have read that it doesn’t scale as well as lustre). The cluster file system technology is pretty simple comparatively speaking even though there are several layers involved. You don’t need metadata controllers or independent systems acting as lock managers. You do need to either have fencing hardware, or a manual fencing process in place to account for failed nodes.

Adding storage involves using LVM to add the LUN to the volume, expand the volume, then using gfs2_grow to expand the filesystem.

At its simplest a configuration could look like the following:

Sun Solaris/OpenSolaris

SAM-QFS
Sun has had SAM-QFS for quite some time. It is more complex than GFS(2). To be fair it also has more features. It requires dedicated metadata controllers. This solution doesn’t interest me much due to the fact that I use Solaris/OpenSolaris for ZFS to simplify storage management.

Adding storage to the shared QFS file system appears to involve a similar process to GFS(2).

At its simplest a configuration could look like the following:

Lustre, BTRFS/CRFS, ZFS and the Future

Lustre
Lustre seems to be more complex than both GFS(2) and SAM-QFS. It also seems to be more scalable. It requires 2 types of controllers: MDS (metadata) and OSS (storage). So from a management perspective it requires quite a bit more overhead.

It looks like ZFS will be added as a backend storage format in the future versions 2.x+ this might simplify LUN management a little for the solution.

BTRFS/CRFS
BTRFS is a Linux ZFS workalike. It is still in development but the final version is shaping up to have ZFS feature parity (minus easy device/raid management, zvols, and other sweet things like zfs send recv). What interests me however is CRFS.

CRFS also in development, appears to be a global network filesystem that basically exports BTRFS file systems.

ZFS
What I would like to see is ZFS to become cluster aware. Being able to use OpenSolaris as a storage host and export either a zfs or a zpool to nodes, and perhaps run a lockmanager process similar to GFS(2) would be pretty slick.

For now with ZFS

Creating ZFS file systems on disk boxes connected to a SAN fabric really isn’t that difficult. It is however a bit time consuming, and you really should configure LUN masking for each LUN and node. Other wise you run the risk of accidentally corrupting data by using the same LUN at the same time on different nodes.

Exporting zvols with Comstar/iSCSI also doesn’t seem like the best idea, because again you should configure target portal groups and host access, and I don’t think that running ZFS on top of a ZFS based virtual block device (zvol) would be great for performance. Jeff Bonwick had stated something about improving the zvol performance (block pass through?) in his last ZFS talk. I am not sure wether it has been done yet or not. There also are no best practice notes about ZFS on iSCSI zvols.

I would think using UFS on a zvol would be better performance wise, but the whole purpose of the exercise for me is to provide snapshot functionality to the host consuming the filesystem, and filesystem delegation to the zones on the host.

I would really like to be able to use zvols as opposed to SAN fabric LUNs because then the zones could use snapshots, I could mirror the zvols from a Comstar storage host to a remote system for disaster recovery, and only have to deal with one zpool.

So how about it? Is anyone else using Comstar, ZFS, and OpenSolaris in the fashion I have described What is your strategy for consolidating storage management with OpenSolaris?

Read More

1

Sun Xvm Server :: First Impressions

Pros

  1. Installer is great, very quick very easy
  2. Web Interface is great
  3. Overall functionality is on par with VMware ESXi

Cons

  1. No support for VLANs on virtual interfaces (yet)
  2. dom0 uses a boatload of RAM, 2GB. My test system only has 2GB at present so I was unable to test guest creation, and as a result the system was a bit sluggish. For enterprise deployments this is a non-issue, however I am trying to put together a solution for SMB that would utilize Xvm server and I think the high RAM requirement might make this cost prohibitive.

Questions

  1. Will dom0 require 2GB of ram in the production release?
  2. Since I am unable to create a guest until I get more RAM, will I be able send VM snapshots to a remote system easily? Say like over the internet as I do with ZFS send, recv?

Read More

0

Why Mirroring is not a backup :: WOW

http://hardware.slashdot.org/article.pl?sid=09/01/02/1546214

All I can say is wow. Data accidents happen. Claiming that a bug in the software could not have overwritten the data, is not a valid excuse for only maintaining a mirror. That is why ZFS makes so many administrators happy. An on system snapshot could have saved the data.

Currently I maintain a mirror of our main data store with 10 days of snapshots. The mirror’s snapshots are also replicated to a remote system using ZFS send, recv. This has saved numerous hours when clients or staff make errors involving accidental file modification or removal. No restoring from tape, no digging through tar files. Just rollback to a snap shot or cd into the snap shot directory and copy out the lost data.

Read More

0

ZFS :: File System Awesomeness

I have been building NAS/SAN systems lately for our company’s remote backup product. We started out using Nexenta’s storage appliance software which utilizes ZFS. After using it for quite some time I think I am going to migrate to OpenSolaris as soon as the November release is available. It turns out that after learning about and using ZFS I really don’t need the web interface to accomplish my objectives.

Some of the great things about ZFS:

  1. Not having to have RAID cards to use RAID
  2. Being able to add storage to a storage pool with out screwing around with a volume manager
  3. The ability to import existing storage pools
  4. Infinite snapshots that occupy minimal amounts of storage space
  5. The ability to use ZFS send and recv to incrementally replicate file systems over SSH links
  6. Online error correction, better than RAID data protection
  7. Simple pool and file system management
  8. Online file system compression and encryption
  9. Virtual block device support

Couple this with OpenSolaris’s native in kernel CIFS and iSCSI server and you have the most inexspensive enterprise storage solution ever.

Truly an admins dream. Unfortunately Linux folks most likely will never see ZFS integrated with the Linux kernel. Hope is not lost however, in 3-4 years Oracle’s BTRFS will be complete and seems to have a similar feature set.

In the BSD camp, it seems the FreeBSD community has done a pretty decent job integrating ZFS. There are only a couple of features missing at present and it appears to be stabilizing.

Dragonfly BSD of course has Matt Dillon’s HAMMER file system which I have been following quite closely. I will probably play with it some more in a couple of releases.

Of course the reason I do not typically use the BSD OS’s in production environments is due to the lack of a binary update system for the base distribution. I like to be able to update quickly and update often, and patching and compiling updates from source is far too time consuming. (OpenBSD being the exception).

With Snow Leopard we should see ZFS make its official debut on the Mac platform, I guess time will tell.

While I would like to see this file system integrated with more open source OS’s I am quite content to stick with OpenSolaris for my storage systems.

UPDATE

Oh and Sun, great work on the 7000 series storage appliances, the implementation is completely sublime. I just wish that I could afford one :(

Read More