Bdim

Understanding Embedding Layers

Imagine you’re in a classroom with many students, each with their own name. The teacher wants to assign tasks based on students’ names, but the names themselves are meaningless and can’t help the teacher make decisions directly. So, the teacher assigns each student a number, like Xiaoming is No. 1 and Xiaohong is No. 2. These numbers act as “labels” for the students, helping the teacher organize them better. However, the numbers alone aren’t enough, as they don’t carry much information....

# NLP # Deep Learning # RNN

OMR Related Paper Review: Practical End-to-End Optical Music Recognition for Pianoform Music

Core Ideas 1. Background of the Study: This paper explores how to implement an end-to-end Optical Music Recognition (OMR) system using deep learning methods, with a focus on recognizing pianoform music. While recent progress has been made in monophonic music recognition, existing OMR models struggle to handle the multi-voice and multi-staff nature of piano music. The complexity of piano music stems from its independent parallel voices, which can freely appear and disappear within a composition....

# OMR # Deep Learning # Image Recognition

How to Understand Batch Normalization?

Batch Normalization (BN) When I first tried to understand Batch Normalization (BN), I went through a lot of resources, but I still found the specific implementation and purpose somewhat unclear. A few days ago, I asked my teacher for an explanation, and he gave me an example that made it much easier to grasp. I found it very helpful, and after going back to review other materials, everything started to make sense....

# Deep Learning # Batch Normalization # BN

My note on learning LeNet 5

Some words “The more I get, the less I know.”——bdim Recently, I am trying to learn more about image recognition. One of the topics that I am interested in is LeNet, which is a classic convolutional neural network. In this article, I will try to re-implement a LeNet using PyTorch and then abstractly understand some parts that I do not quite understand. The contents of this article may be inaccurate and will continue to update the details....

# LeNet # Deep learning # CNN # macOS # PyTorch

Documenting the Recovery of a Lost Video File Due to FX3 Power Failure

Today, while recording a video for a client, the fake battery ran out of power, causing the camera to shut down abruptly and the video file to be unsaved. I didn’t realize this immediately because my impression of the FX3 was that it would save the video file even if the power was forcibly cut off during recording. It wasn’t until I got home and opened the memory card that I was shocked: the first video file was missing!...

# FX3 # CFA # Video File Loss # Recovery # DAT