{"id":31842,"date":"2018-07-21T07:30:52","date_gmt":"2018-07-21T07:30:52","guid":{"rendered":"https:\/\/ismartnetwork.co.uk\/?p=31842"},"modified":"2018-07-21T07:30:52","modified_gmt":"2018-07-21T07:30:52","slug":"4-questions-to-ask-before-you-start-a-machine-learning-project","status":"publish","type":"post","link":"https:\/\/ismartnetwork.co.uk\/?p=31842","title":{"rendered":"4 Questions to Ask Before You Start a Machine Learning Project"},"content":{"rendered":"<p><img decoding=\"async\" src=\"http:\/\/bdcstatic.business.com\/images\/content\/5b4\/3996d1039eb645a8b45e0\/280-280-\" width=\"280\" align=\"left\"><\/p>\n<p>AI and machine learning are making a significant impact on multiple industries and changing the landscape of our society. These are not just hot trends; they are here to stay.<\/p>\n<p>Still, machine learning is not a magical solution that applies to every single use case. So often companies embark on an AI development journey without a clear understanding of the value it should bring to their business. As a result, many data science and machine learning projects don\u2019t have clear KPIs and simply drain R&amp;D budgets.<\/p>\n<p>That\u2019s why managers have to ask themselves four key questions to justify the need for machine learning development.<\/p>\n<p>1. Do you really need machine learning?<\/p>\n<p>Machine learning projects are costly and take up much time. Additionally, the estimates for AI R&amp;D projects are often quite vague and unrealistic. Most importantly, not all companies need machine learning in the first place. For instance, we argue in\u00a0<a href=\"https:\/\/www.n-ix.com\/machine-learning-in-finance-why-what-how\" target=\"_blank\">this article<\/a>\u00a0that solid data engineering alone can be enough to find invaluable business insights for companies across numerous industries.<\/p>\n<p>The number of AI engineers is also an enormous challenge, as talent availability is a significant constraint across the globe. Machine learning still have certain limitations, and it currently doesn\u2019t fit into every business case of every domain. According to <a href=\"https:\/\/techcrunch.com\/2016\/10\/23\/wtf-is-machine-learning\/\" target=\"_blank\">TechCrunch<\/a>, one can split all machine learning use cases into two categories:<\/p>\n<p>Classification models are used to break down large datasets into meaningful subsets. The examples would be image recognition and natural language processing.<\/p>\n<p>Regression models identify trends to make predictions. The use cases would be sales forecasts that take into account thousands of factors from macroeconomic indicators to weather forecasts to political threats.<\/p>\n<p>Research institutions and tech companies have made massive progress in certain areas of machine learning, including computer vision, speech recognition, and <a href=\"https:\/\/en.wikipedia.org\/wiki\/Natural_language_processing\" target=\"_blank\">natural language processing<\/a>. Still, this technology is not a silver bullet. As of 2018, you cannot apply machine learning to every business case you might have in mind.<\/p>\n<p>For instance, a computer vision engine that can identify a particular bottle in a row of identical bottles can prove to be too costly. It\u2019s only logical to have a small team of data scientists investigate the use case before proceeding with development.<\/p>\n<p>2. What type of machine learning do you need?<\/p>\n<p>Currently, there are three major types of machine learning: supervised, unsupervised, and reinforcement learning. Let\u2019s check out the use cases for each one of them.<\/p>\n<p>Supervised<\/p>\n<p>Nearly 90\u00a0percent of current machine learning development projects deal with supervised learning.<\/p>\n<p>You have an input data X and a target variable Y that you want to predict.<\/p>\n<p>For instance an X could be parameters\u00a0that describe a person like gender, age and personal preferences. Looking at this input data, you want to predict Y, how likely the person is to click your marketing ad in Facebook.<\/p>\n<p>This technique is valid when you\u2019ve got some big datasets of customer information and historical records that reveal who clicked your ads in the past. A supervised machine learning model analyzes that input data to find patterns and predict what demographic groups are most likely to click your ad.<\/p>\n<p>Other use cases for supervised learning would be credit scoring, underwriting, equipment failure detection, and more.<\/p>\n<p>Unsupervised<\/p>\n<p>With unsupervised learning, there\u2019s just input data X and no target variables. Machine learning models then groups input data according to its reasoning. AI algorithms work through huge datasets and often find patterns and dependencies that humans cannot identify.<\/p>\n<p>This technique is often used for marketing clustering. For instance, we can take the input data from the example above, and let the AI engine group people according to demographics and personal interests.<\/p>\n<p>Reinforcement learning<\/p>\n<p>With reinforcement learning, data scientists specify the rules of the \u201cgame\u201d, the environment where the \u201cgame\u201d takes place, and the final reward (in chess analogy, that would be the victory). As machine learning algorithms start \u201cplaying the game\u201d, they try different strategies and learn from their previous experience to maximize the final reward. One of the most famous examples of reinforcement learning is Google\u2019s <a href=\"https:\/\/deepmind.com\/research\/alphago\/\" target=\"_blank\">AlphaGo<\/a>.<\/p>\n<p>Deep learning<\/p>\n<p>Deep learning, a technique that utilizes artificial neural networks, is applicable to all three machine learning types, but is most often used in supervised learning. Deep learning is excellent at classifying objects based on their features. For instance, it can be used to categorize pictures of cats and dogs with high precision.\u00a0<\/p>\n<p>Deep learning is behind Facebook\u2019s Face Recognition technology, which is 99\u00a0percent accurate. The same technology powers advanced natural language processing (NLP), image and speech recognition software, which can be used in\u00a0document processing (e.g., legal documents), sentiment analysis and word-processing software.<\/p>\n<p>3. Are you ready for extensive data engineering?<\/p>\n<p>Machine learning and data science depend heavily on data engineering. Before going for data science, you need to extract data from fragmented sources, transform it into usable datasets, and load it to the AI engine. The bad news is these tasks often cannot be automated. Different sources have unique types of data, so adjusting them requires a lot of manual work.<\/p>\n<p>Even after the data has been extracted, transformed and loaded, it might not be good enough for data science. So next step is to clean the dataset by removing noisy data and adding any missing entries. ETL (extract, transform, and load) and data cleaning usually take up about 80 percent of the project\u2019s time.<\/p>\n<p>4. Do you need custom development, API software, or startup acquisition?<\/p>\n<p>There are three strategies for companies to adopt machine learning.<\/p>\n<p>1. Build a machine learning solution from scratch. This is probably the riskiest option, as only an estimated 10 percent of machine learning R&amp;D projects succeed. It is still the most viable option for some narrow machine learning cases in specific domains.<\/p>\n<p>2.\u00a0Explore machine learning with cloud engines from Google, Amazon and the like. This is the easiest way to gain access to machine learning technology. On the downside, you cannot freely configure system parameters.\u00a0For instance, Amazon uses only logistic regression models, so is practically useless if you need to use different models for a particular project. That means, more sophisticated machine learning projects require custom solutions development. Furthermore, 80\u00a0percent of machine learning development is still about big data engineering. This is something you cannot delegate to Amazon.<\/p>\n<p>3.\u00a0Buy a machine learning startup. This is the most expensive option that suits only big companies.<\/p>\n<p>Data science and machine learning often produce unexpected results and give invaluable insights. This technology is here to stay, and it\u2019s going to evolve at an extremely fast pace. Answering the above questions will help you start your machine learning development journey.<\/p>\n<div style=\"clear:both;\"><\/div>\n<p>Read more: <a href=\"https:\/\/www.business.com\/articles\/machine-learning-project-questions\/?utm_medium=referred&#038;utm_source=rss&#038;utm_campaign=rss_site\">business.com<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>AI and machine learning are making a significant impact on multiple industries and changing the landscape of our society. These are not just hot trends; they are here to stay. Still, machine learning is not a magical solution that applies to every single use case. So often companies embark on an AI development journey without &hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[1],"tags":[11408,46,3910],"class_list":["post-31842","post","type-post","status-publish","format-standard","","category-uncategorized","tag-computers","tag-software","tag-technology"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/ismartnetwork.co.uk\/index.php?rest_route=\/wp\/v2\/posts\/31842","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ismartnetwork.co.uk\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ismartnetwork.co.uk\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ismartnetwork.co.uk\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ismartnetwork.co.uk\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=31842"}],"version-history":[{"count":0,"href":"https:\/\/ismartnetwork.co.uk\/index.php?rest_route=\/wp\/v2\/posts\/31842\/revisions"}],"wp:attachment":[{"href":"https:\/\/ismartnetwork.co.uk\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=31842"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ismartnetwork.co.uk\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=31842"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ismartnetwork.co.uk\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=31842"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}