The simplest definition of Kafka is that it is an open-source distributed streaming platform that serves data continuously. Developed by LinkedIn, written in Java and Scala, and then open-sourced to Apache Foundation in 2012, Kafka’s main contributor is still LinkedIn.
As opposed to batch data, which can be historical data stored in databases, streaming data is continuously generated by multiple sources continuously ordered in time. At LinkedIn, Kafka was initially created to simplify activity tracking and gather logs and application information. That is way back in 2011. So, Kafka is not a new technology, but it is a proven technology that has stood the test of time and has improved over the years.
The simplest definition of Kafka is that it is an open-source distributed streaming platform that serves data continuously. Developed by LinkedIn, written in Java and Scala, and then open-sourced to Apache Foundation in 2012, Kafka’s main contributor is still LinkedIn.
As opposed to batch data, which can be historical data stored in databases, streaming data is continuously generated by multiple sources continuously ordered in time. At LinkedIn, Kafka was initially created to simplify activity tracking and gather logs and application information. That is way back in 2011. So, Kafka is not a new technology, but it is a proven technology that has stood the test of time and has improved over the years.