Tag: how to

December 29, 2020 Greggory Elias No comments exist

Tutorial: How to pre-train ELECTRA for Spanish from Scratch Originally published by Skim AI’s Machine Learning Researcher, Chris Tran. Introduction This article is on how pre-train ELECTRA, another member of the Transformer pre-training method family, for Spanish to achieve state-of-the-art results in Natural Language Processing benchmarks. It is Part III in a series on training…

December 28, 2020 Greggory Elias

Tutorial: How to Fine-Tune BERT for Extractive Summarization Originally published by Skim AI’s Machine Learning Researcher, Chris Tran 1. Introduction Summarization has long been a challenge in Natural Language Processing. To generate a short version of a document while retaining its most important information, we need a model capable of accurately extracting the key points…

July 27, 2020 Greggory Elias No comments exist

Natural Language Generation and Its Business Applications <h1>Natural Language Generation (NLG)</h1> As a continued exploration of AI Authors and Robot-Generated news, it is worthwhile to explore some of the technology driving these algorithms. AI designed to generate documents that read like a human wrote them rely on Natural Language Generation (NLG) algorithms. NLG algorithms are…

March 20, 2020 Greggory Elias

10 Questions to Ask Before Starting a Machine Learning Project Over 80% of data science projects fail to go beyond testing and into production. If everyone is starting a machine learning project, where is it going wrong? Undoubtedly, ML solutions increase efficiencies for those who are in the business of gathering or analyzing large swaths…

December 5, 2019 Asya Sharrow

Topic Modeling for Product Managers <h1>What is Topic Modeling?</h1> Topic modeling is a type of natural language processing (NLP) used to find “topics,” or commonly occurring words or groups of words, within a set of documents. Topic models are critical to product managers because they enable them to sort and analyze the huge amounts of…

November 11, 2019 Asya Sharrow No comments exist

10 Best Practices for Storing Labeled Data You just had your big idea. You read a lot, and you thought it would be interesting to have a classifier that labels a speaker’s tone and determines their political affiliation. How would you begin to break down the problem so that you can use machine learning to…