Transparent fault-tolerant network services using off-the-shelf components

Transparent fault-tolerant network services using off-the-shelf components(2005)

引用 23|浏览5
暂无评分
摘要
The growth of the Internet has led to the development of critical network services where erroneous processing or outages are unacceptable. The availability and reliability of services such as online banking, stock trading, reservation processing, and online shopping, have become increasingly important as their popularity grows. Downtime and failures lead to unsatisfied customers and translate directly into lost revenue for the service providers. Fault-tolerance techniques use redundant components and/or redundant processing to ensure continued correct operation despite component failures. Most existing fault-tolerance solutions for network services do not provide fault-tolerance for active connections at failure time, expect servers to be deterministic, or require changes to the clients. These limitations are unacceptable for many current and future network service applications. We propose a methodology for providing fault-tolerance without the limitations mentioned above. Our solution, based on a standby backup approach, is transparent to the clients and requires minimal changes to the server OS and application. We have used our methodology to add fault-tolerance features to two popular types of network services---web service and video conferencing. Off-the-shelf hardware and software components were used as the basis for both implementations. Modifications to the OS network stack using Linux kernel modules provide fault-tolerance at the connection level. At the application level, modifications to the web server and multi-conferencing unit, respectively, provide application-level synchronization and allow handling of non-deterministic server behavior. The associated issues, challenges, and tradeoffs of our methodology are presented in this work. The evaluation of our prototype implementations shows that client-transparent fault-tolerance can be achieved with relatively low overheads.
更多
查看译文
关键词
OS network,transparent fault-tolerant network service,off-the-shelf component,existing fault-tolerance solution,fault-tolerance feature,non-deterministic server behavior,client-transparent fault-tolerance,network service,future network service application,critical network service,erroneous processing,redundant processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要