Today's Question:  What does your personal desk look like?        GIVE A SHOUT

 BIG DATA


  Redis Cluster and Common Partition Techniques in Distributed Cache

In this post, I will discuss a few common partition techniques in distributed cache. Especially, I will elaborate on my understanding on the use of Redis Cluster.Please understand that at the time of writing, the latest version of Redis is 4.0.10. Many articles on the same topic have a different idea from this post. This is mainly because, those articles are probably outdated. In particular, they may refer to the Redis Cluster implementation in Redis 3. Redis Cluster has been improved a lot since Redis 4.Common Partition TechniquesHere, we refer to horizontal partitioning, which...

5,478 0       REDIS DISTRIBUTED CACHE CLOUD COMPUTING


  Hadoop or Spark: Which One is Better?

What is Hadoop?Hadoop is one of the widely used Apache-based frameworks for big data analysis. It allows distributed processing of large data set over the computer clusters. Its scalable feature leverages the power of one to thousands of system for computing and storage purpose. A complete Hadoop framework comprised of various modules such as:Hadoop Yet Another Resource Negotiator (YARNMapReduce (Distributed processing engine)Hadoop Distributed File System (HDFS)Hadoop CommonThese four modules lie in the heart of the core Hadoop framework. There are many more modules available over the interne...

2,467 0       COMPARISON HADOOP SPARK


  Why Most of us Get Confuse With Data Quality Solutions and Bad Data?

How to fix this misunderstanding is what Big Data professionals will explain in this post.The C-level executives are using data collected by their BI and analytics initiatives to make strategic decisions to offer the company a competitive advantage. The case gets worse if the data is inaccurate or incorrect. It’s because the big data helps the company to make big bets, and it impacts the direction and future together. Bad Data can yield inappropriate results and losses.Some interesting facts and statistics about big data, data warehousing, and data quality-90% of US companies are applyin...

1,309 0       BIGDATA


  Data Scientists and Their Harder Skills than Big Data

The field of data science is often confused with that of big data. Data science is an aid to decision makers in a company with a logical approach. Who is a Data Scientist? A Data Scientist reviews a huge collection of data(that may extend to a couple of terabytes of disk space or thousands of excel sheets). This humongous chunk of data is not feasible for being handled, sorted and analyzed by a single person.Here we require the help of data science, and most recently, the field of Artificial Intelligence has gained considerate limelight. With the use of efficient algorithms we can so...

3,296 0       BIG DATA


  How Cloud Computing is changing the Face of Business

The world of information is getting bigger and bigger and so does the need for cloud computing is felt broadly across various industries and platforms. The ever growing popularity and adoption are due to the fact that cloud computing is efficient, reliable and secure than any other business model. However, the way cloud computing is adopted across different enterprises may vary.How cloud computing has been adopted worldwide by companies- let us have a look at few statistics that would blow your mind.·         The cloud market valued $148 billion i...

2,832 0       CLOUD COMPUTING SOFTWARE DEVELOPMENT CLOUD SERVICES CLOUD SOLUTION SOFTWARE SOLUTIONS


  How Google Utilizes Big Data for SERP

Google is an expert when it comes to big data. This is evident in their development of various techniques and open source tools which are used by the big data industry professionals. These tools and technique allow Google to sift through millions of different websites and enormous amounts of data in order to provide users with correct answers in a matter of milliseconds. But how does Google accomplish that with such precision? To answer that, we need to focus on the complex activities that go on behind every search query.Entering the search queryGoogle has always wanted to make a search engine...

2,240 0       GOOGLE BIG DATA


  Cleansing data with Pig and storing JSON format to HBase with Pig UDF

IntroductionThis post will explain you the way to clean data and store JSON format to HBase. Hadoop architect experts also explain Apache Pig and its advantages in Hadoop in this post. Read more and find out how they do it.This post contains steps to do some basic clean the duplication data and convert the data to JSON format to store to HBase. Actually, we have some built-in lib to parse JSON in Pig but it is important to manipulate the JSON data in Java code before store to HBase.Apache Pig is data flow language and is built on the top of Hadoop, it helps to process, extract, loading, cleans...

9,010 3       JSON HADOOP ARCHITECT APACHE HBASE PIG UDF


  IBM acquires Ustream to propel its cloud business

On January 21, 2016, IBM acquired Ustream, a leading live and on-demand video solution company, to propel its cloud service business. This acquisition will make IBM capable of providing enterprise live video stream service to the world. With this, a new member joining the IBM cloud service family.Ustream provides cloud-based video streaming to enterprises and broadcasters for everything from corporate keynotes to live music concerts. The company streams live and on-demand video to about 80 million viewers per month for customers such as NASA, Samsung, Facebook, Nike, HBO and The Discovery Chan...

1,561 0       IBM CLOUD IBM CLOUD USTREAM