Oct. Announcement of ‘measures to prevent recurrence of eating disorders’
3 data center linkage upgrade plans
Review of the remote DR center dedicated to KakaoTalk transmission
High-intensity training in preparation for large-scale disability
Ansan center redundant infrastructure under construction
Namgoong Hoon “Service stabilization is the top priority”
Kakao, which analyzed the cause of the service failure caused by the SK C&C Pangyo data center fire on October 15th, announced on the 7th that it would go beyond redundancy and go beyond triplexing in which three data centers are linked in relation to system redundancy, which is the core of the problem. In addition, a large-scale renovation plan was put forward, including a large-scale investment three times larger than the previous one over the next five years and the establishment of a dedicated organization.
Kakao plans to design and build multiplexing in the entire system, and designate and manage restoration priorities in consideration of the importance of services. Koh Woo-chan (Vice President of Kakao Enterprise), co-chairman of the Recurrence Prevention Committee of the Emergency Response Committee, said, “We will invest more than three times the amount invested over the past five years in securing talent for stabilizing services, developing technology, and implementing disaster recovery (DR) beyond triple redundancy in the next five years.” I will put in a year,” he said.
Kakao explained that if the DR system is upgraded to more than triple redundancy, it will have stability that ensures redundancy even in a situation where one data center is incapacitated. It is also reviewing a plan to build a remote DR data center dedicated to the KakaoTalk message transmission function.
It also decided to recruit the best information technology (IT) engineering experts in Korea and organize an IT engineering organization under direct control of the CEO. It also announced that it would create a disaster recovery committee to strengthen its ability to quickly respond to large-scale failures and conduct intensive preparation training.
The data center in Ansan, Gyeonggi Province, which is being built with the goal of completion in 2024, explained that it is building redundant infrastructure for ’24-hour uninterrupted operation’ in three areas: power, cooling, and communication. The battery room and the uninterruptible power supply (UPS) are separated by a fire barrier, so that even if a fire breaks out in the battery room, the triple extinguishing method is activated to prevent a situation similar to the SK C&C Pangyo data center.
Kakao announced this plan through the annual developer conference ‘if Kakao Dev 2022’ held on the same day. Nam Gung-hoon (former Kakao representative), who served as the keynote speaker, said, “I realized that Kakao’s top priority is to provide stable services,” and pledged to “always keep this in mind.” .
Outsider Lee Hak-yeong (Greb CEO), chairman of the Cause Investigation Subcommittee, cited insufficient redundancy between data centers and service operation management tools as reasons for the delay at the time, as well as the lack of available resources and manpower following redundancy. Subcommittee Chairman Lee explained that the lack of space (space in the data center) required for redundancy was the most critical, and there was no control tower to oversee recovery and response in the early stage of the incident.
Reporter Kim Min-seok