Sunday, June 18, 2017

Big Data & Machine learning

   
       Today we were learning  about Big Data & Machine learning. Mr Uthayasangar (lecturer in Moraduwa university)  taught about communication, machine learning & Big Datas.

💥Machine learning




             Machine learning focuses on the development of computer programs that can change when exposed to new data. The process of machine learning is similar to that of data mining.In machine learning, computers apply statistical learning techniques to automatically identify patterns in data. These techniques can be used to make highly accurate predictions. Keep scrolling.

 💥What are the different types of machine learning?

     These algorithms can be applied to almost any data problem:
  • Linear Regression.
  • Logistic Regression.
  • Decision Tree.
  • SVM.
  • Naive Bayes.
  • KNN.
  • K-Means.
  • Random Forest.

    💥categorization of machine learning tasks arises when one considers the desired output of a machine-learned system


  • In classification inputs are divided into two or more classes, and the learner must produce a model that assigns unseen inputs to one or more  of these classes. This is typically tackled in a supervised way. Spam filtering is an example of classification, where the inputs are email (or other) messages and the classes are "spam" and "not spam".

  • In regression also a supervised problem, the outputs are continuous rather than discrete.

  • In clustering a set of inputs is to be divided into groups. Unlike in classification, the groups are not known beforehand, making this typically an unsupervised task.

💥What is meant by machine learning algorithms?
   Evolved from the study of pattern recognition and computational learning theory in artificial intelligence, machine learning explores the study and construction of algorithms that can learn from and make predictions on data – such algorithms overcome following strictly static program instructions by making data-driven .


💥 BIG DATA
Data which are very large in size is called Big Data. Normally we work on data of size MB(WordDoc ,Excel) or maximum GB(Movies, Codes) but data in Peta bytes i.e. 10^15 byte size is called Big Data.

💥From where this data comes from

These data come from many sources like

  • Social networking sites: Facebook, Google, LinkedIn all these sites generates huge amount of data on a day to day basis as they have billions of users worldwide.

  • E-commerce site: Sites like Amazon, Flipkart, Alibaba generates huge amount of logs from which users buying trends can be traced.

  • Weather Station: All the weather station and satellite gives very huge data which are stored and manipulated to forecast weather.

  • Telecom company: Telecom giants like Airtel, Vodafone study the user trends and accordingly publish their plans and for this they store the data of its million users.

  • Share Market: Stock exchange across the world generates huge amount of data through its daily transaction.
      💥   Hadoop is an open source framework. It is provided by Apache to process and analyze very huge volume of data. It is written in Java and currently used by Google, Facebook, LinkedIn, Yahoo, Twitter etc.
💥Communication



Communication is the act of expressing (or transmitting) ideas, information, knowledge, thoughts, and feelings, as well as understanding what is expressed by others.
💥How to Develop Good Communication Skills
  

💥How do you communicate well with others?

  1. Pause before responding. 
  2. Be trustworthy and honest. 
  3. Don't rush communication.
  4. Adapt your ideas to others. 
  5. Stay in the moment. 
  6. Pay attention to non-verbal cues. 
  7. Intend to understand. 
  8. Be patient and open-minded.

No comments:

Post a Comment