Skip to main content

Posts

Showing posts with the label Machine Learning

Why Evaluation Metrics Matters

This is a follow up article on "The Importance of F1 Score" in which we understood the technical aspects of evaluating a Machine Learning model. In this article we will understand how different evaluation metrics can help us in designing solutions based on the problem statement and domain. I will like to distinguish evaluation metrics with respect to the following criteria: When there is life involved : In case of aircraft or ships, where a Machine Learning algorithm misses to detect a failure can be a costly affair, since there is life involved. Also in the case of detecting cancer, a failure to predict a true positive can lead to delay in treatment, and hence can be life threatening. In cases such as detecting failures in aircrafts, even if we get false positives is acceptable, but not able to detect true positives can be too expensive in terms of loss of life. Thus it becomes important to use Recall as a measure, as any positive detection becomes important, even...

Understanding Projection Pursuit Regression

The following article gives an overview of the paper "Projection Pursuit Regression” published by Friedman J. H and Stuetzle W. You will need basic background of Machine Learning and Regression before understanding this article. The algorithms and images are taken from the paper. ( http://www.stat.washington.edu/courses/stat527/s13/readings/FriedmanStuetzle_JASA_1981.pdf )  What is Regression? Regression is a machine learning technology used to predict a response variable given multiple predictor variables or features. The main distinction is that the response to be predicted is any real value and not just any class or cluster name. Hence though similar to Classification in terms of making a prediction, it is largely different given what it’s predicting.  A simple to understand real world problem of regression would be predicting the sale price of a particular house based on it’s square footage, given that we have data of similar houses sold in that area ...

The Value of Data

Although, there is not a simple answer for what came first - Chicken or Egg?, in Machine Learning, there is an easy answer. Data came first before any function. Machine learning is all about learning from data. The learning algorithm tries to learn a function that can either classify the data into different categories, or learn the function itself that plots the data. There are two popular ways in which a Machine Learning algorithm can be taught to learn a function, but in both cases it needs data. Supervised Learning : We give the algorithm a lot of data with both input and output, and it learns the function. In case of regression problems, the function approximately plots the function that understands the data. In case of classification problems, the function tries to classify the data. Unsupervised Learning : We give the algorithm a lot of input data with no output, and it tries to find patterns in the data. The algorithm classifies the data based on the similarities ...

GPU - The brain of Artificial Intelligence

Machine Learning algorithms require tens and thousands of CPU based servers to train a model, which turns out to be an expensive activity. Machine Learning researchers and engineers are often faced with the problem of running their algorithms fast. Although initially invented for processing graphics in computer games, GPUs today are used in machine learning to perform feature detection from vast amount of unlabeled data. Compared to CPUs, GPUs take far less time to train models that perform classification and prediction. Characteristics of GPUs that make them ideal for machine learning Handle large datasets Needs far less data centre infrastructure Can be specialized for specific machine learning needs Perform vector computations faster than any known processor Designed to perform data parallel computation NVIDIA CUDA GPUs today are used to build deep learning image processing tools for  Adobe Creative Cloud. According to NVIDIA blog future Adobe appli...

Dynamics of Selecting your Open Source AI

The landscape of open source AI is big. To identify suitable open source tools to make your AI dream product is a herculean task. Selecting an AI toolkit for your product might turn out costly when you need to scale your software, thus it turns out to be a strategic decision. We at CereLabs have developed a criteria to choose Open Source AI Toolkit. Vision/ Reason for open source If you need to  trust an open source platform, you need to start with the vision statement with which the open source AI platform is launched. The  vision statement portrays the commitment of the company or community towards the toolkit.     Following are the visions of few of the reputed AI Open Source Platforms:       OpenCog : “OpenCog is a unique and ambitious open-source software project. Our aim is to create an open source framework for Artificial General Intelligence , intended to one day express general intelligence at the human level ...

Why Tensorflow

You have a lot of data which you want to make sense of, learn patterns, but you don't have the necessary expertice to develop algorithms that will learn through the data. Ofcourse you can develop your own machine learning algorithms to make sense of the data. There might be benifits in developing your own algorithms, proprietary being one, but you might have to invest time and money. What if you have access to ready made machine learning algorithms which you just have to use in your products? Google's Tensorflow offers such tried and tested algorithms using APIs that you just have to call in your programs. All you have to provide is data, and Tensorflow will take care of the intelligence to learn. Tensorflow adds the following capabilities to your products 1. Access to machine learning algorithms such as Neural Networks. 2. Increase performance of your models using multiple CPUs and GPUs without change in code. 3. Do numerical computations using data flow g...