OpenZFS is an open-source implementation of the ZFS file system and volume manager initially developed by Sun Microsystems for the Solaris operating system and now maintained by the OpenZFS Project. It supports features like data compression, data deduplication, copy-on-write clones, snapshots, and RAID-Z. It also supports the creation of virtual devices, which allows for the creation of file systems that span multiple disks.

OpenZFS
Initial releasePorted to various systems between 2006 and 2010. Forked from OpenSolaris August 2010; 13 years ago (2010-08)
Stable release
2.2.4[1] Edit this on Wikidata / 2 May 2024; 2 days ago (2 May 2024)
Preview release
2.2.0-rc5[2] / October 8, 2023; 6 months ago (2023-10-08)[2]
Repositorygithub.com/openzfs/zfs
Written inC
Operating systemOpenSolaris, illumos distributions, OpenIndiana, FreeBSD, Mac OS X Server 10.5 (only read-only support), NetBSD, Linux via third-party kernel module ("ZFS on Linux")[3] or ZFS-FUSE, OSv
Licenseopen source CDDL
Websiteopenzfs.org
OpenZFS Project
Formation2013; 11 years ago (2013)
ProductsOpenZFS filesystem
Parent organization
Software in the Public Interest
Websiteopenzfs.org

One of the capabilities of OpenZFS is self-healing. The file system can detect and correct errors while in use, without the need for a dedicated file system checker. This means that it can be used in mission-critical applications that require high availability.

OpenZFS is widely used in enterprise and data center environments, as well as in consumer devices like network-attached storage (NAS) devices. It is available for many operating systems, including Linux, FreeBSD, macOS, and Windows (through third-party solutions). OpenZFS is licensed under the CDDL (Common Development and Distribution License), which allows for both open-source and commercial use.

Founding members of OpenZFS include Matt Ahrens, one of the main architects of ZFS.[4] In 2020, the code-bases of OpenZFS and ZFS on Linux were merged and released as OpenZFS 2.0. This release brought features and performance enhancements from the fast-developing ZFS on Linux, to other platforms such as FreeBSD.[5][6][7]

History

The ZFS file system was originally developed by Sun Microsystems for the Solaris operating system. The ZFS source code was released in 2005 under the Common Development and Distribution License (CDDL) as part of the OpenSolaris operating system, and it was later ported to other operating systems and environments.[8][9]

The following is a list of key events to the development of ZFS and its various implementations, leading to the creation of OpenZFS as an umbrella project:[8][10]

  • 2001: Closed-source development of ZFS started with two engineers at Sun Microsystems.
  • 2005: ZFS source code was released as part of OpenSolaris.
  • 2006: Development of a FUSE ZFS port for Linux started.
  • 2007: Apple started porting of ZFS to Mac OS X.
  • 2008: A port to FreeBSD was released as part of FreeBSD 7.0.
  • 2008: Development of a native ZFS Linux port started, known as ZFS on Linux.
  • 2009: Apple's ZFS project closed, and the MacZFS project continued to develop the code. (It has since also been discontinued; it was current through MacOS 10.9).
  • 2010: OpenSolaris was discontinued, resulting in the further development of ZFS on Solaris being no longer open-source.
  • 2010: illumos was forked from OpenSolaris as its open-source successor,[11][12] and continued to develop ZFS in the open. Ports of ZFS to other platforms continued pulling in upstream changes from illumos.
  • 2012: Feature flags were introduced to replace legacy on-disk version numbers, enabling easier distributed evolution of the ZFS on-disk format to support new features.
  • 2013: Coexisting with the stable version of MacZFS, its prototype generation (known as OpenZFS on OS X or O3X) uses ZFS on Linux as the new upstream codebase.[13][14]
  • 2013: The first stable release of ZFS on Linux.[15]
  • 2013: Official announcement of the OpenZFS as an umbrella project.[16][17] New features and fixes are regularly pulled into OpenZFS from illumos and pushed into all ports to other platforms, and vice versa.[8]
  • 2016: Ubuntu 16.04 includes the open-source ZFS file system variant by default.
  • 2020: ZFS on Linux was renamed to OpenZFS and added FreeBSD support, unifying the codebase for both platforms.[18]

Ported versions (2005–2010)

As the FSF (Free Software Foundation) claimed a CDDL and GPL legal incompatibility in 2005, Sun's implementation of the ZFS file system wasn't used as a basis for the development of a Linux kernel module, it wasn't merged into the Linux kernel mainline, and Linux distributions did not include it as a precompiled kernel module.[19][20] As a workaround, FUSE, a framework that allows file systems to run in user-space, was used on Linux as a separation layer for which the licensing issues are not in effect, although with a set of its own issues that include performance penalty.[9][21] However, the April 2016 release of Ubuntu 16.04 LTS includes CDDL-licensed ZFS on Linux[22] as a kernel module that is maintained as a separate project, outside the Linux kernel mainline, claiming license compatibility.[23][24][25]

Apple/macOS

In the release version of Mac OS X 10.5, ZFS was available in read-only mode from the command line, which lacks the possibility to create z-pools or write to them.[26] Before the 10.5 release, Apple released the "ZFS Beta Seed v1.1", which allowed read-write access and the creation of z-pools;[27] however, the installer for the "ZFS Beta Seed v1.1" has been reported to only work on version 10.5.0, and has not been updated for version 10.5.1 and above.[28] In August 2007, Apple opened a ZFS project on their Mac OS Forge web site. On that site, Apple provided the source code and binaries of their port of ZFS which includes read-write access, but there was no installer available[29] until a third-party developer created one.[30] In October 2009, Apple announced a shutdown of the ZFS project on Mac OS Forge. That is to say that their own hosting and involvement in ZFS was summarily discontinued. No explanation was given. Apple eventually released the legally required, CDDL-derived, portion of the source code of their final public beta of ZFS, code named "10a286". Complete ZFS support was once advertised as a feature of Snow Leopard Server (Mac OS X Server 10.6).[31] However, by the time the operating system was released, all references to this feature had been removed from its features page.[32] Apple has not commented regarding the omission.

Apple's "10a286" source code release, and versions of the previously released source and binaries, have been preserved and new development has been adopted by the Mac ZFS project[33][34][35] to resume its development elsewhere. As of July 2012, Mac ZFS implements z-pool version 8 and ZFS version 2, from the October 2008 release of Solaris. Additional historical information and commentary can be found on the Mac ZFS web site and FAQ.[36]

Implementations

OpenSolaris

  • OpenSolaris 2008.05, 2008.11 and 2009.06 use ZFS as their default filesystem. There are over a dozen 3rd-party distributions.

OpenIndiana

  • OpenIndiana uses Open ZFS with feature flags as implemented in Illumos. ZFS version 28 used up to version 151a3.[37]
  • By upgrading from Open Solaris snv_134 to both Open Indiana and Solaris 11 Express, one also has the ability to upgrade and separately boot Solaris 11 Express on the same ZFS pool.[38]

BSD

macOS

  • Open ZFS on OSX (abbreviated to O3X) is an implementation of ZFS for macOS.[39] O3X is under active development, with close relation to ZFS on Linux and illumos' ZFS implementation, while maintaining feature flag compatibility with ZFS on Linux. O3X implements z-pool version 5000, and includes the Solaris Porting Layer (SPL) originally written for Mac ZFS, which has been further enhanced to include a memory management layer based on the illumos kmem and vmem allocators. O3X is fully featured, supporting LZ4 compression, deduplication, ARC, L2ARC, and SLOG.[citation needed]
  • MacZFS is free software providing support for ZFS on macOS. The stable legacy branch provides up to ZFS pool version 8 and ZFS filesystem version 2. The development branch, based on ZFS on Linux and OpenZFS, provides updated ZFS functionality, such as up to ZFS zpool version 5000 and feature flags.[40][41]
  • A proprietary implementation of ZFS (Zevo) was available at no cost from GreenBytes, Inc., implementing up to ZFS file system version 5 and ZFS pool version 28.[42] Zevo offered a limited ZFS feature set, pending further commercial development; it was sold to Oracle in 2014, with unknown future plans.[citation needed]

DragonFlyBSD

NetBSD

FreeBSD

  • Paweł Jakub Dawidek ported ZFS to FreeBSD, and it has been part of FreeBSD since version 7.0.[45] This includes zfsboot, which allows booting FreeBSD directly from a ZFS dataset.[46][47]
  • FreeBSD's ZFS implementation is fully functional; the only missing features are kernel CIFS server and iSCSI, but the latter can be added using externally available packages.[48] Samba can be used to provide a userspace CIFS server.
  • FreeBSD 13.0-RELEASE switches ZFS implementation from illumos-based code base to the unified OpenZFS 2 code base.[49] This change allows FreeBSD to receive OpenZFS improvements much quicker.[50]

MidnightBSD

  • MidnightBSD, a desktop operating system derived from FreeBSD, supports ZFS storage pool version 6 as of 0.3-RELEASE. This was derived from code included in FreeBSD 7.0-RELEASE. An update to storage pool 28 is in progress in 0.4-CURRENT and based on 9-STABLE sources around FreeBSD 9.1-RELEASE code.[citation needed]

TrueOS (formerly PC-BSD)

  • TrueOS (formerly known as PC-BSD, now defunct[51]) was a desktop-oriented distribution of FreeBSD, which inherited its ZFS support.[citation needed]

TrueNAS Core, (formerly FreeNAS)

pfSense

  • pfSense, an open source BSD based router, supports ZFS, including installation and booting to ZFS pools, as of version 2.4.

OPNsense

XigmaNAS

  • XigmaNAS (formerly NAS4Free), an embedded open source network-attached storage (NAS) distribution based on FreeBSD, has the same ZFS support as FreeBSD, ZFS storage pool version 5000. This project is a continuation of FreeNAS 7 series project.[53]

Debian GNU/kFreeBSD

  • Being based on the FreeBSD kernel, Debian GNU/kFreeBSD has ZFS support from the kernel. However, additional userland tools are required,[54] while it is possible to have ZFS as root or /boot file system[55] in which case required GRUB configuration is performed by the Debian installer since the Wheezy release.[56]
  • As of January 31, 2013, the ZPool version available is 14 for the Squeeze release, and 28 for the Wheezy-9 release.[57]

Linux

Although the ZFS filesystem supports Linux-based operating systems, difficulties arise for Linux distribution maintainers wishing to provide native support for ZFS in their products due to potential legal incompatibilities between the CDDL license used by the ZFS code, and the GPL license used by the Linux kernel. To enable ZFS support within Linux, a loadable kernel module containing the CDDL-licensed ZFS code must be compiled and loaded into the kernel. According to the Free Software Foundation, the wording of the GPL license legally prohibits redistribution of the resulting product as a derivative work,[58][59] though this viewpoint has caused some controversy.[60][61]

ZFS on FUSE

One potential workaround to licensing incompatibility was trialed in 2006, with an experimental port of the ZFS code to Linux's FUSE system. The filesystem ran entirely in userspace instead of being integrated into the Linux kernel, and was therefore not considered a derivative work of the kernel. This approach was functional, but suffered from significant performance penalties when compared with integrating the filesystem as a native kernel module running in kernel space.[62] As of 2016, the ZFS on FUSE project appears to be defunct.

Native ZFS on Linux

A native port of ZFS for Linux produced by the Lawrence Livermore National Laboratory (LLNL) was released in March 2013,[63][64] following these key events:[65]

  • 2008: prototype to determine viability
  • 2009: initial ZVOL and Lustre support
  • 2010: development moved to GitHub
  • 2011: POSIX layer added
  • 2011: community of early adopters
  • 2012: production usage of ZFS
  • 2013: stable GA release

As of August 2014, ZFS on Linux uses the OpenZFS pool version number 5000, which indicates that the features it supports are defined via feature flags. This pool version is an unchanging number that is expected to never conflict with version numbers given by Oracle.[66]

KQ InfoTech

Another native port for Linux was developed by KQ InfoTech in 2010.[67][68] This port used the zvol implementation from the Lawrence Livermore National Laboratory as a starting point. A release supporting zpool v28 was announced in January 2011.[69] In April 2011, KQ Infotech was acquired by sTec, Inc., and their work on ZFS ceased.[70] Source code of this port can be found on GitHub.[71]

The work of KQ InfoTech was ultimately integrated into the LLNL's native port of ZFS for Linux.[70]

Source code distribution

While the license incompatibility may arise with the distribution of compiled binaries containing ZFS code, it is generally agreed that distribution of the source code itself is not affected by this. In Gentoo, configuring a ZFS root filesystem is well documented and the required packages can be installed from its package repository.[72]Slackware also provides documentation on supporting ZFS, both as a kernel module[73] and when built into the kernel.[74]

Ubuntu integration

The question of the CDDL license's compatibility with the GPL license resurfaced in 2015, when the Linux distribution Ubuntu announced that it intended to make precompiled OpenZFS binary kernel modules available to end-users directly from the distribution's official package repositories.[75] In 2016, Ubuntu announced that a legal review resulted in the conclusion that providing support for ZFS via a binary kernel module was not in violation of the provisions of the GPL license.[76] Others,[77] such as the Software Freedom Law Center[78] followed Ubuntu's conclusion, while the FSF and SFC reiterated their opposing view.[79][80]

Ubuntu 16.04 LTS ("Xenial Xerus"), released on April 21, 2016, allows the user to install the OpenZFS binary packages directly from the Ubuntu software repositories.[81][82][83][84] As of 2023, no legal challenge has been brought against Canonical regarding the distribution of these packages.

As of 2019, Ubuntu supports experimental installation of ZFS as a root filesystem, starting with the 19.10 release ("Eoan Ermine"), to support coexistence of a nearly pure ZFS OS with GRUB and other operating systems on the same disk.[85][86]

TrueNAS Scale

A version of TrueNAS by iXsystems, based on Debian Linux. As with TrueNAS Core (based on FreeBSD), it uses OpenZFS for storage and adds a variety of additional features. These include expanded device driver support, KVM virtual machines, PCIe passthrough and container support via Kubernetes and Docker. Furthermore, it allows clustered Docker and ZFS via gluster. Information about the current release can be found at the iXsystems Software Status page.[87]

Microsoft Windows

A port of open source ZFS was attempted in 2010 but after a hiatus of over one year development ceased in 2012.[88] In October 2017, a new port of OpenZFS was announced by Jörgen Lundman at OpenZFS Developer Summit.[89][90]

Releases and feature histories

Within illumos and (from 2013) OpenZFS

  • 2010: illumos founded as an open-source successor,[91] and continued to develop ZFS in the open. Ports of ZFS to other platforms continued porting upstream changes from illumos.[citation needed]
  • 2013: OpenZFS project founded, aiming at coordinated open-source development of ZFS. The OpenZFS project provides a common foundation for any interested groups and organizations to contribute and collaborate towards a common open source ZFS core, and in addition, to also maintain any specific code and validation processes needed for core ZFS code to work with their own individual systems.

Other ports and forks

The following is a list of open-source ZFS ports and forks:[65][92]

  • 2006: Development of a FUSE port for Linux started.
  • 2007: Apple started porting ZFS to Mac OS X.
  • 2008: A port to FreeBSD was released as part of FreeBSD 7.0.
  • 2008: Development of a native Linux port started.
  • 2009: Apple's ZFS project closed. The MacZFS project continued to develop the code.

Detailed release histories

With ZFS in Oracle Solaris: as new features are introduced, the version numbers of the pool and file system are incremented to designate the format and features available. Features that are available in specific file system versions require a specific pool version.[93][94]

Distributed development of OpenZFS involves feature flags[40] and pool version 5000, an unchanging number that is expected to never conflict with version numbers given by Oracle. Legacy version numbers still exist for pool versions 1–28, implied by the version 5000.[95] Illumos uses pool version 5000 for this purpose.[96][97] Future on-disk format changes are enabled / disabled independently viafeature flags.

Version history

Legend:
Latest FOSS stable release
ZFS Pool Version NumberRelease dateSignificant changes
5000OpenZFSUnchanging pool version to signify that the pool indicates new features after pool version 28 using ZFS feature flags rather than by incrementing the pool version

Pool versions and feature flags

Originally, version numbers of the pool and file system were incremented as new features were introduced, in order to designate the on-disk file system format and available features. This worked well when a single entity controlled the development of ZFS, and this versioning scheme is still in use with the ZFS in Oracle Solaris.[98][99]

In a more distributed development model, having a single version number is far from ideal as all implementations of OpenZFS would need to agree on all changes to the on-disk file system format. The solution selected by OpenZFS was to introduce feature flags as a new versioning system that tags on-disk format changes with unique names, and supports both completely independent format changes and format changes that depend on each other. A pool can be moved and used between OpenZFS implementations as long as all feature flags in use by the pool are supported by both implementations.[10]: 20, 26–27 [100]: 2–3 [101]

In OpenZFS, the pool version is permanently set to 5000, signifying that the pool indicates new features by setting or unsetting ZFS feature flags rather than by incrementing the pool version.[40] The number 5000 was chosen because it is expected to never conflict with version numbers given by Oracle. Legacy version numbers still exist for pool versions 1–28.[102][103][104] Future on-disk format changes are enabled / disabled independently via these feature flags.

Legacy version numbers still exist for pool versions 1–28, and are implied by the pool version 5000;[105] the initial proposal was to use 1000 as the pool version.[100]: 4  Future on-disk format changes are enabled and disabled independently via feature flags.

Feature flags are exposed as pool properties, following these naming scheme rules:[100]: 4 

  • Format of the property name is feature@<org-name>:<feature-name>
  • <org-name> is the reverse DNS name of the organization that developed the feature, ensuring unique property names.
  • Property names can be shortened to feature@<feature-name> when they remain unambiguous.

For example, feature@com.foocompany:async_destroy is a valid property name, and it could be shortened to feature@async_destroy.[100]: 4 

Each pool feature can be in either disabled, enabled, or active state. Disabled features are those that will not be used, and no on-disk format changes will be made; as a result, such features are backward-compatible. Enabled features are those that will be used, no on-disk format changes have been made yet, but the software may make the changes at any time; such features are still backward-compatible. Active features are those that have made backward-incompatible on-disk format changes to the pool.[100]: 5 

When any pool feature is enabled, legacy version of the pool is automatically upgraded to 5000 and any other prerequisite features are also enabled. By default, new pools are created with all supported features enabled. In general, state of a feature can be changed from active back to enabled, undoing that way performed on-disk format changes and making the pool compatible again with an older OpenZFS implementation; however, for some features that might not be possible.[100]: 5, 9 [105]

On-disk format changes can be associated with either features for write or features for read. The former are the features that an OpenZFS implementation must support to be capable of writing to the pool, while supporting such features is not mandatory for opening the pool in read-only mode. The latter are the features that an OpenZFS implementation must support to be able to read from the pool or to just open it, because opening a pool is not possible without actually reading from it.[100]: 7 

For example, async_destroy feature adds a new on-disk data structure to keep track of freed datasets, but an OpenZFS implementation does not need to know about this data structure to access the pool in read-only mode. Additionally, writing to a pool that has some features in active state is not possible by an OpenZFS implementation that does not support the same features.[100]: 7–8 

A list of feature flags and which operating systems support them is available from the OpenZFS documentation Web site[106] (here the old Open-ZFS.org Web site[107])

OpenZFS 2.0

Historically, OpenZFS has been implemented as a core ZFS code, with each operating system's team adapting it to fit into their projects. This led in some cases to feature stagnation and divergence of features and command lines, as different operating systems developed divergent features and bug fixes, often for a single platform rather than across all platforms. Over time, new feature development shifted from Illumos to Linux.[108] These new features and fixes then had to be backported to Illumos before they could be re-ported for FreeBSD.[108] But this was difficult because the Linux version also included many smaller changes, which were hard to disentangle.[108]

In 2018, it was agreed that OpenZFS development would be overhauled to remedy these issues.[108] Rather than try to import all the Linux changes to other platforms piecemeal, the entire Linux ZFS code would be 'pivoted' as a whole, with other platforms being based on the more actively developed Linux version.[108] A wide range of ported and new features, including many long-desired enhancements, would also be rolled out or ported across platforms, and future changes would be discussed across platforms before being implemented.[108] The plans included appropriate porting layers to prevent Linux, GPL or Linux-KPI shim code from being introduced to other platform kernels.[108]

The features in progress or ported for OpenZFS 2.0 is lengthy, and includes:

  • Faster rollout of enhancements and new features across platforms[108]
  • Command line standardisation[108]
  • Improved pool portability (ZFS pools created on one system can be equally used by another)[7]
  • Wider cross-platform feature parity and platform independence[7]
  • Overlay (union) mounts accepted by default[7]
  • Bug fixes and enhancements[7]
  • ZTS and various other features working on FreeBSD[7]
  • TRIM and ACLMODE enhancements[7][108]
  • ZFS holds (from FreeBSD)[7]
  • Enhanced native NFSv4 ACLs (FreeBSD)[7]
  • Enhanced AES-GCM performance for encrypted pools[7]
  • Redacted send/receive[7]
  • Log spacemap and other metaslab management enhancements - a project to re-implement ZFS' management of free space and "metaslabs" for much greater efficiency[7]
  • Fast clone deletion[7]
  • Zstd data compression as a new option[7]
  • Channel program property inheritance[7]
  • AltiVec RAID-Z[7]
  • Bookmark support and copying[7]
  • Direct IO support[7]
  • Persistent L2ARC (L2ARC retained across reboots)[7]
  • Sequential (high speed) scrub and resilver[108]
  • Scrub pause/resume[108]
  • Resilver restart[108]
  • Device (VDEV) removal[108]
  • Zpool initialize and checkpoint[108]
  • Channel programs[108]
  • Large Dnode[108]
  • Allocation classes (allowing specific high speed storage to be designated for metadata and deduplication tables)[108]
  • Parallel pool mounting[108]
  • Per-vdev properties[108]
  • Deduplication enhancements – dedup-log (high speed deduplication), dedup table size limits, and deduplication table preloading (loaded fully at one time rather than piecemeal as needed), listed as "nice to have" in 2018, were all stated in April 2020 to be "coming along nicely" or largely complete[109]

See also

References

External links