ASSIGNMENT:1 TOPIC: BIG DATA HITESHKUMAR 2K15/MC/027 A proper definitionof “big data” is difficult to achieve because projects, vendors, developers,and business professionals use it quite differently.
With these things in mind,generally speaking, big data is: Bunch of large datasets Or a category of computing strategies and technologies that are used to handle large datasetsWhere “large dataset”means that a dataset too large to reasonably process on a single computer orstore with traditional tooling. This means that the common scale of bigdatasets is constantly shifting and may vary significantly from organization toorganization.Big data is a term that is used to describe data that is highvolume, high velocity, and/or high variety; requires new technologies andtechniques to capture, store, and analyze it; and is used to enhance decisionmaking, provide insight and discovery, and support and optimize processes.Here, big data is used tobetter understand customers and their behaviors and preferences. Companies arekeen to expand their traditional data sets with social media data, browser logsas well as text analytics and sensordata to get a more complete picture of their customers.
Big Data Sources. Big data sources are repositories of large volumes of data. … This bringsmore information to users’ applications without requiring that the data be held in asingle repository or cloud vendor proprietary data store.
Examples of big data sources are AmazonRedshift, HP Vertica, and MongoDB.The general consensus of theday is that there are specific attributes that define big data. In most bigdata circles, these are called the four V’s: volume, variety,velocity, and veracity. (You mightconsider a fifth V, value.)That’s why big data analyticstechnology is so important to heathcare. By analyzing large amounts of information – both structured andunstructured – quickly, health care providers can provide lifesaving diagnosesor treatment options almost immediately.
Big data tools: Talend Open Studio. Talend also offers an Eclipse-based IDEfor stringing together data processingjobs with Hadoop. Its tools are designedto help with data integration, data quality, and data management,all with subroutines tuned to these jobs.
So, ‘Big Data’ is also a data butwith a huge size. ‘Big Data’ isa term used to describe collection of data that is huge in size and yet growingexponentially with time.In short, sucha data is so large and complex that none of the traditional data managementtools are able to store it or process it efficiently.Statistic shows that 500+terabytes of new data gets ingested into the databases of socialmedia site Facebook, every day.
Thisdata is mainly generated in terms of photo and video uploads, messageexchanges, putting comments etc.Benefits of UsingBig Data Analytics· Identifying the root causes of failures and issues in real time.· Fully understanding the potential of data-driven marketing.· Generating customer offers based on their buying habits.· Improving customer engagement and increasing customer loyalty.
· Reevaluating risk portfolios quickly.