Saturday, July 2, 2022
Techiexpert.com
No Result
View All Result
  • Login
  • Register
Exclusive Videos
  • Tech news
  • Startup news
  • Artificial Intelligence
  • IOT
  • Big Data
  • Cloud
  • Data Analytics
  • Machine Learning
  • Blockchain
  • Social Media
  • Tech news
  • Startup news
  • Artificial Intelligence
  • IOT
  • Big Data
  • Cloud
  • Data Analytics
  • Machine Learning
  • Blockchain
  • Social Media
No Result
View All Result
TechiExpert
No Result
View All Result

How Big Data Has Created A Big Crises In Science

Srikanth by Srikanth
March 14, 2019
in Big Data
Reading Time: 3 mins read
How Big Data Has Created A Big Crises In Science

How Big Data Has Created A Big Crises In Science

9
SHARES
126
VIEWS
Share on FacebookShare on Twitter

According to scholars in major areas of science, it is impossible to reproduce which can also be severe. A scenario of Bayer Health care reviewed 67 projects in 2011, found that the replication is done for less than 25%. And 2/3 rd of the project had large irregularity. And another example is the most recent investigation on November stated that only half could be replicated. Other fields like economics and medicine had reported the same. All these striking results resulted in the deep loss of credibility of the major scientist.

When coming to the issue of big data there are many factors influencing it. According to statistics, there is a huge difference in the way scientific inventions are done in the big data era. The crisis of reproducibility is partly driven by invalid statistical analysis derived from data-driven hypotheses the opposite of the way things are done traditionally.

Scientific Method:

In previous methods of experimenting science, both statisticians and scientist work together. At first, scientist conducted experiments to collect the data and later statisticians analyse the data which is collected. In 1920, in the research of academics, a women claimed that she could guess the flavours of milk or tea which was added first to the cup. This is famous as “lady tasting tea”, which was doubted by Ronald fisher that whether she could guess the taste. They develop a model based on the probability called hypergeometric distribution. He hypothesized that, out of eight cups of tea, prepared in such a way that four cups plus milk first and four other cups plus tea, the number of correct guesses would follow the probability.

Eight cups of tea are sent in a random order to the lady in order to conduct the experiment. And to the surprise, she detected all the eight cups correctly which is strong evidence against fisher’s hypothesis. The percentage is low as 1.4 per cent in which the lady had achieved the correct answers by random guessing. With the help of today technology, we can collect a huge amount of data about 2.5 extra bytes a day. The process hypothesis involves gathering data and analyzing is spare in the era of big data.

The development of science is much slower and the researchers may not know the right hypothesis while analyzing the data. The process of lady tasting tea the order of seeing the data and hypothesis had been reversed.scientists can now collect tens of thousands of gene expressions from humans, but it is very difficult to decide whether one must enter or exclude certain genes in the hypothesis. In this case, it is interesting to form a hypothesis based on data. While such a hypothesis might seem interesting, the conventional conclusions of this hypothesis are generally invalid.

Problems with Data:

Let us consider a 100 ladies tasting tea for considering big data. For example, there are 100 ladies who don’t know the taste of tea but after the experiment of guessing there are 75.6 per cent chances that at least one lady could luckily guess it. The scientist may be surprising to find that one lady who could guess all the cups correctly.

If the same experiment is conducted again with the same lady the result might not be reproducible as the first time result may be due to luck and hence she doesn’t even know the real difference between tea and milk. This example illustrates to us how scientist is coming through interesting from the dataset. They can formulate hypotheses after these signals, then use the same dataset to draw conclusions, claiming these signals are real. It may take a while before they find that their conclusions cannot be reproduced. This problem is very common in large data analytics because of the large data size, just by chance, some false signals might fortunately occur. To produce the most publishable result the data is to be manipulated by a scientist with the help of this process.

Strong Analysis:

The only way in which the scientist can achieve productivity and avoid all the problems is being more careful. To provide the valid inferences new design procedures should be designed by statisticians. If scientists want results that can be reproduced from data-driven hypotheses, then they need to consider data-driven processes carefully in the analysis.

The most optimal way to extract information from the analytic data is statistics. It is the most evolving field in nature in which the big data era is just an example of evolution problems. The development of a statistical technique which produces interesting and valid scientific discoveries.

Tags: BigData analytics
Share4Tweet2Share1Pin1

Related Posts

Why 'green' path is important for data center operations
Big Data

Why ‘green’ path is important for data center operations

IPL
Big Data

How Sports Analytics is used in IPL

data streaming
Big Data

The Top trends to data streaming 2022

How big data can lift up the tourism industry
Big Data

How big data can lift up the tourism industry

Relationship between Digital Transformation and leadership
Big Data

Relationship between Digital Transformation and leadership

Most Read

  • How to Track Someone’s iPhone by Phone Number?

    How to Track Someone’s iPhone by Phone Number?

    419 shares
    Share 168 Tweet 105
  • Top 5 car automation trends to know

    215 shares
    Share 86 Tweet 54
  • Is Parody Coin investment a Good Investment?

    83 shares
    Share 33 Tweet 21
  • What is windows modules installer ? How to Enable/Disable

    1240 shares
    Share 496 Tweet 310
  • Tips to Reduce Your Website Hosting Costs

    877 shares
    Share 350 Tweet 219
  • How to Track Activities an Instagram account?

    85 shares
    Share 34 Tweet 21

Recent Stories

Doing Cleanup: 5 Types of Links You Should Disavow

Backlinks
Share4Tweet2Share1Pin1

Telemedicine or e-medicine: What is it?

Telemedicine Business
Share5Tweet3Share1Pin1

Hyperlocal marketplace Urvann raises Rs. 3 Cr in Seed Round led by IPV

Hyperlocal marketplace Urvann raises Rs. 3 Cr in Seed Round led by IPV
Share5Tweet3Share1Pin1

Does domain extensions impact SEO standards

Does domain extension impact SEO standards
Share4Tweet3Share1Pin2
  • Terms of use
  • Privacy Policy
  • About Us
  • Contact us
  • Write For Us
  • Cookie Policy

© 2022 All Rights Reserved

No Result
View All Result
  • Tech news
  • Startup news
  • Artificial Intelligence
  • IOT
  • Big Data
  • Cloud
  • Data Analytics
  • Machine Learning
  • Blockchain
  • Social Media

© 2022 All Rights Reserved

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms below to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Cookie Law Notice
This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Cookie settingsACCEPT
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT