#ExpandHashtags for NLP

Expand hashtags with upper() and lower() letters for Natural Language Processing The infamous hashtag can present issues when processing text data for NLP. The annoyances begin with “those” people who overuse hashtags on a regular basis. The bothersome nature continues when millennials use them in casual conversation, or when the entire purpose is muted, suchContinue reading “#ExpandHashtags for NLP”

NLP Preprocess Tweets

Natural language processing, also known as NLP, combines computer science and linguistics to understand and process the relationships contained within communication languages. Words, characters, documents, sentences, and punctuation can play a factor in how humans understand language, and using this information, computers are capable of also learning and understanding how humans communicate by analyzing theseContinue reading “NLP Preprocess Tweets”

Audio File Conversion

When it comes to processing audio files for classification in Python, having a .WAV format is critical to creating visualizations and extracting features from the data/audio file. Common audio file formats include .FLAC, .MP3, and several other codecs. To convert files from their original format to .wav formatted files, the Pydub library provides a convenientContinue reading “Audio File Conversion”

Visualizing Sound in Python

An audio file, or any sound really, has the following properties: Frequency, Wavelength, Amplitude, Speed, Direction These are characteristics that can be used to distinguish sounds from each other in neural networks, not unlike in the human brain. Visualizing audio files is an important task in data science, due to the means of processing classificationContinue reading “Visualizing Sound in Python”

Label by Subfolder

Importing data into machine learning projects differs depending on how the data is delivered. For data that is split beforehand, into train, test, and validation folders, there are often subfolders, sometimes within subfolders containing data, whether it be image, audio, or video files, labels are necessary for supervised learning models. If the data is organizedContinue reading “Label by Subfolder”

Design a site like this with WordPress.com
Get started