Every day, large organizations are updated with the technologies that facilitate and adapt better to each company, facing great challenges that allow them to discover and analyze, in addition to the tools that are used on a daily basis, it is for them that what was created was created. is known as Big Data or in Spanish massive data, which are large-scale data storage systems.
This storage phenomenon is framed in the new information and communication technologies. Big Data is the one that occupies all the activities related to the systems that store a large set of data.
One of the main features is that it handles a lot of information, collecting, classifying and storing it. The purpose of this collection is to create statistical reports for use by organizations, whether as analysis of business plans, advertising, espionage, among others.
The storage margin has grown over the years, since 2008 the storage level has been measured in petabytes to zetabytes of data. Specialists periodically look for new storage measures because there are certain areas where large amounts of data need to be stored and existing programs are not very suitable.
There are thousands of tools to perform and manage Big Data, but not all are the same, there are three types of Data, which are:Structured Data: are those in which the data have a very particular structure, such as dates, numbers, among others. An example of them is spreadsheets. Unstructured data: it is usually data that has a specific format and cannot be stored in a spreadsheet, much less manipulate the information, such as PDF documents. Semi-structured data: this type of data does not have a particular format, as it has its own semi-structured metadata, an example of which is HTML codes.