Tutorial: How to train a RoBERTa Language Model for Spanish
SpanBERTa: How We Trained RoBERTa Language Model for Spanish from Scratch Originally published by Skim AI's Machine Learning Research Intern, Chris Tran. spanberta_pretraining_bert_from_scratch Introduction¶ Self-training methods with transformer models have...
Tutorial: Fine-tuning BERT for Sentiment Analysis
Tutorial: Fine tuning BERT for Sentiment Analysis Originally published by Skim AI's Machine Learning Researcher, Chris Tran. BERT_for_Sentiment_Analysis A - Introduction¶ In recent years the NLP community has seen many breakthoughs in Natural Language Processing,...
10 Questions to Ask Before Starting a Machine Learning Project
10 Questions to Ask Before Starting a Machine Learning Project Over 80% of data science projects fail to go beyond testing and into production. If everyone is starting a machine learning project, where is it going wrong? Undoubtedly, ML solutions increase efficiencies...
How to use Skim AI in your research process
How to use Skim AI in your research process It’s very likely that your organization’s current methods for managing the research process, the data collected from it and the content produced as a result are coming up short. Tools like Google Drive, Evernote, and...
The new wave in journalism – robot writers?
The new wave in journalism - robot writers? So, how prevalent are robot reporters? What dangers do robot reports pose? What is the likelihood that the article you are reading wasn’t written by a human? A 2015 report stated the AP was generating about 3,000 articles a...
Product Update – Smarter Searching
WIth smart search functionality and sleeker design, Skim AI v3.0 makes your research efforts even more streamlined. Install for free today.
Topic Modeling for Product Managers
Topic Modeling for Product Managers What is Topic Modeling? Topic modeling is a type of natural language processing (NLP) used to find “topics,” or commonly occurring words or groups of words, within a set of documents. Topic models are critical to product managers...
10 Best Practices for Storing Labeled Data
10 Best Practices for Storing Labeled Data You just had your big idea. You read a lot, and you thought it would be interesting to have a classifier that labels a speaker’s tone and determines their political affiliation. How would you begin to break down the problem...
What You Should Know Before You Select a Sentiment Analysis Dataset
Every sentiment model requires training data, referred to as a sentiment analysis dataset. There are a few things you should know before you make your decision as to which popular dataset to use.
Are We Really Making Progress on Neural Recommendation Approaches?
Are We Really Making Progress on Neural Recommendation Approaches? A summary of Maurizio Ferrari Dacrema, et al.’s Recent Article at RecSys 2019 Neural Recommendation AlgorithmsRecommendation algorithms have become ubiquitous across commercial fields, from the Amazon...
One Tool for (All) Researchers
One tool for (all) researchers It's common practice to classify things into different categories to simplify a concept or a problem. But how often does that classification lead us to apply our predisposed notions of one category on the overall concept? For example,...
Searching for top 20% facts
Searching for top 20% facts We all know that there’s an infinite amount of data on the internet for essentially any topic. Don’t believe me? Let’s test it. It took Google 0.46 seconds to give me 460 MILLION results for “multi-label text classification.” You get the...
Real-time production models – How do they differ from benchmark tests?
Real-time production models - How do they differ from benchmark tests? What are Real-Time Production Models and Benchmark Tests? Real-time production models are models that enable users to take data collected during production and analyze both current production...
Product Update – Take Notes Your Way
Skim AI is making it even easier to take notes your way. Read more about our exciting new features that allow you to customize your Skim AI experience even further.
50,000 Websites and 10,000 Hours
Skim AI was born from two numbers: 50,000 which is the number of new websites with new information published every day. And 10,000, the number of hours it takes to become an expert at something, like mining information from 50,000 new websites a day.
How to Summarize the News
Learn how to summarize news for any situation with Skim AI’s top tips.
5 steps to writing a top-notch thesis statement
We here at Skim AI know how to say a lot with few words and that’s the goal of every thesis statement. But it’s not easy. Seriously…how do you break down the main idea of a five to six-page research paper into one sentence? Well, we’ve broken down that whole process into five easy steps just for you.
How to Painlessly Write a Summary
How to Painlessly Write a Summary At Skim AI, we’re experts at getting to the point. We know the summarization process is tough so we’ve put together a guide to help you painlessly write a summary no matter your purpose for doing so. Before you start, determine why...
Ready to grow your business with AI? Get in touch
Call