Apache Zookeeper
Apache ZooKeeper is a coordination service built to manage large distributed systems. It coordinates the activities of different hosts and the use of common data with robust synchronization techniques.
While presenting itself externally as a single service, ZooKeeper forms a cluster of multiple nodes and server instances. This so-called Zookeeper Ensemble organizes itself by choosing a master node which takes the lead in synchronizing the cluster and managing consistency.
In a cluster, ZooKeeper provides the following services:
- Naming service to identify and address nodes in a cluster
- Cluster management to add or remove individual nodes
- Synchronization service to manage the saving and changing of distributed data
- Redundancy service to ensure high availability of data and services despite individual node failures
- Information service to provide real-time node status information
- Configuration service to provide real-time node configuration data
- Procedure for appointing a Master Node
Originally developed by Yahoo, ZooKeeper became a sub-project of Hadoop at Apache before becoming a standalone project in 2008. Today, ZooKeeper is a kind of standard for organizing distributed services and is used by HBase, Hadoop and similar frameworks.
- Learn more
- Official website
Related articles
Internship in Big Data infrastructure with TDP
Categories: Infrastructure, Learning | Tags: Cyber Security, DevOps, Java, Hadoop, IaC, Internship, TDP
Job Description Big Data and distributed computing is at Adaltasā core. We support our partners in the deployment, maintenance and optimization of some of Franceās largest clusters. Adaltas is also anā¦
By Daniel HARTY
Oct 25, 2021
Lightweight containerization with Tupperware
Categories: Containers Orchestration, Open Source Summit Europe 2017, Infrastructure | Tags: Btrfs, LXD, Red Hat, Systemd, Zookeeper, Cloud, Consensus
In this article, I will present lightweight containerization set up by Facebook called Tupperware. What is Tupperware Tupperware is a homemade framework written and used internally at Facebookā¦
Nov 3, 2017
Advanced multi-tenant Hadoop and Zookeeper protection
Categories: Big Data, Infrastructure | Tags: DoS, iptables, Operation, Scalability, Zookeeper, Clustering, Consensus
Zookeeper is a critical component to Hadoopās high availability operation. The latter protects itself by limiting the number of maximum connections (maxConns = 400). However Zookeeper does not protectā¦
Jul 5, 2017
Composants for CDH and HDP
Categories: Big Data | Tags: Flume, Hortonworks, Hadoop, Hive, Oozie, Sqoop, Zookeeper, Cloudera, CDH, HDP
I was interested to compare the different components distributed by Cloudera and HortonWorks. This also gives us an idea of the versions packaged by the two distributions. At the time of this writtingā¦
By David WORMS
Sep 22, 2013