Multimodal AI Algorithm Expert-EMG / Interaction Perception – Pico
Company | ByteDance |
---|---|
Location | San Jose, CA, USA |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | Master’s |
Experience Level | Senior, Expert or higher |
Requirements
- Master’s degree or above in machine learning, signal processing, computer science, statistics, speech and language technology, or a related field
- Proficient in the fundamentals of deep learning, with substantial experience in training and deploying deep models
- Experience in fine-tuning end-to-end speech recognition frameworks (e.g., Conformer, RNN-T, LAS, CTC)
- Familiarity with 2D/3D perception algorithms in computer vision
- Familiarity with sEMG signal acquisition and processing, including signal denoising (e.g., filtering algorithms) and feature extraction (e.g., time-domain and frequency-domain features)
- Experience with common digital signal processing techniques, such as digital filtering, Fourier transforms, correlation analysis, and modulation.
Responsibilities
- Research and develop innovative deep learning models with a focus on the intersection of surface electromyography (sEMG), computer vision, and IMU technologies
- Design and implement sEMG signal acquisition pipelines, optimize signal quality, and perform data preprocessing tasks such as denoising and feature extraction
- Explore spatiotemporal feature fusion methods (e.g., Transformer, LSTM, Spatiotemporal Convolutional Networks) to achieve efficient multimodal data fusion
- Handle sensor noise and interference in various environments, optimize model performance, and improve the generalization of algorithms
Preferred Qualifications
- Experience in analyzing and modeling high-dimensional time-series data, such as speech signals, neural signals, physiological signals, videos, or other sensor data
- Publications in top academic conferences (e.g., CVPR, ICCV, ECCV) or participation in competitions like Kaggle, COCO, ImageNet, ActivityNet, and 3D-related challenges.