Category: ML / NLP

March 20, 2020 Greggory Elias No comments exist

10 Questions to Ask Before Starting a Machine Learning Project Over 80% of data science projects fail to go beyond testing and into production. If everyone is starting a machine learning project, where is it going wrong? Undoubtedly, ML solutions increase efficiencies for those who are in the business of gathering or analyzing large swaths…

January 30, 2020 Asya Sharrow No comments exist

The new wave in journalism – robot writers? So, how prevalent are robot reporters? What dangers do robot reports pose? What is the likelihood that the article you are reading wasn’t written by a human? A 2015 report stated the AP was generating about 3,000 articles a quarter, all related to financial and business reports….

December 5, 2019 Asya Sharrow No comments exist

Topic Modeling for Product Managers What is Topic Modeling? Topic modeling is a type of natural language processing (NLP) used to find “topics,” or commonly occurring words or groups of words, within a set of documents. Topic models are critical to product managers because they enable them to sort and analyze the huge amounts of…

November 11, 2019 Asya Sharrow No comments exist

10 Best Practices for Storing Labeled Data You just had your big idea. You read a lot, and you thought it would be interesting to have a classifier that labels a speaker’s tone and determines their political affiliation. How would you begin to break down the problem so that you can use machine learning to…

October 23, 2019 Asya Sharrow No comments exist

Are We Really Making Progress on Neural Recommendation Approaches? A summary of Maurizio Ferrari Dacrema, et al.’s Recent Article at RecSys 2019​ Neural Recommendation Algorithms Recommendation algorithms have become ubiquitous across commercial fields, from the Amazon “yourstore” splash page to Netflix’s matching % scores. Recommendation algorithms in essence filter large sets of data, i.e. song…

September 10, 2019 Asya Sharrow No comments exist

Searching for top 20% facts We all know that there’s an infinite amount of data on the internet for essentially any topic. Don’t believe me? Let’s test it. It took Google 0.46 seconds to give me 460 MILLION results for “multi-label text classification.” You get the point. So when we built the algorithm the enables…

August 26, 2019 Asya Sharrow No comments exist

Real-time production models – How do they differ from benchmark tests? 1. What are Real-Time Production Models and Benchmark Tests? Real-time production models are models that enable users to take data collected during production and analyze both current production capabilities and predict future production outputs. These are models meant to optimize production and assess performance…