Linux FailSafe 1.0
http://oss.sgi.com/projects/failsafe/
====================================
Short term tasks (6-9 months)
-----------------------------
Scalability
Increase scalability to 8 nodes.
Testing with large number of resources and resource groups.
Infrastructure enhancements
Support for volume managers (such as LVM)
Support for Network mirroring (such as DRBD)
Testing Shared SCSI solutions
Testing Storage area network solutions
Supporting for reset devices using STONITH API
API
Develop application (FailSafe resource type) API
Develop cluster API (Group communication services layer interface)
Integration with other products
Integration with IPVS to provide load balancing solution
Integration with monitoring tools such as "mon"
Usability enhancements (GUI and command line interfaces)
Provide interim status of long running tasks (such as creating
cluster, stopping HA services in the cluster)
Improve error messages and logging mechanisms
Generate cluster_mgr CLI scripts for a configuration database
Generate cluster_mgr CLI scripts for a set of GUI tasks
Executing cluster_mgr CLI scripts from the GUI
Develop GUI plugin model for resource types
Develop GUI plugin model for reset mechanisms
FailSafe applications
Oracle, DB2 database agents
Documentation
Improve FailSafe administration manual
Improve FailSafe programmers manual
Develop FailSafe developers manual (document failsafe internals)
Long term roadmap (more than 6-9 months)
---------------------------------------
Scalability
Scalability beyond 8 nodes. Improve protocol scalability.
Infrastructure enhancements
Configuration database based on MySQL or some other database
Common heartbeat service that can be used by configuration
database daemon (cdbd) and cluster membership (ha_cmsd)
Better I/O fencing mechanism (GRITS API from Oracle)
Developing 64bit version of FailSafe for 64bit platforms
Integration with other products
Integration with DLM using cluster API
Integration with cluster filesystems (GFS, CXFS)
Supporting Oracle parallel server using cluster API
Testing FailSafe on non-i386 platforms (alpha, ultraSparc, IA64)
Security
Encryption and authentication of cluster messages
Disaster recovery
Geographic clustering and failover.