Details of a Researcher - TANAKA, Keitaro

写真a

TANAKA, Keitaro

Scopus Paper Info

Paper Count: 18 Citation Count: 69 h-index: 5

Click to view the Scopus page. The data was downloaded from Scopus API in July 22, 2026, via http://api.elsevier.com and http://www.scopus.com .

Google Scholar Information (Citations per year)

Citation Count: 131 h-index: 7 i10-index: 4

Click to view the Google Scholar page.

Scopus Information

Affiliation

Faculty of Science and Engineering, Waseda Research Institute for Science and Engineering

Job title

Junior Researcher(Assistant Professor)

Degree

Doctor of Engineering ( 2025.03 Waseda University )

Research Experience

2025.04

-

Now

早稲田大学理工学術院総合研究所次席研究員
2022.04

-

2025.03

Japan Society for the Promotion of Science

Education Background

2022.04

-

2025.03

Waseda University Graduate School of Advanced Science and Engineering
2020.04

-

2022.03

Waseda University Graduate School of Advanced Science and Engineering
2016.04

-

2020.03

Waseda University School of Advanced Science and Engineering

Committee Memberships

2025.04

-

Now

情報処理学会音楽情報科学研究会 SIGMUS 運営委員

Professional Memberships

　

　

　

ISCA
　

　

　

APSIPA
　

　

　

情報処理学会
　

　

　

IEEE

Research Areas

Perceptual information processing

Research Interests

Audio-Visual Multimodal
Deep Learning
Music Information Retrieval

Awards

2025年度山下記念研究賞（内定）

2025.09 情報処理学会

Winner：田中啓太郎
学生優秀賞

2025.08 画像の認識・理解シンポジウム (MIRU)

Winner：佐々木馨, 佐藤和仁, 山口周悟, 田中啓太郎, 森島繁生
オーディエンス賞

2025.08 画像の認識・理解シンポジウム (MIRU)

Winner：佐々木馨, 佐藤和仁, 山口周悟, 田中啓太郎, 森島繁生
ACM Student Research Competition Semi-Finalist

2025.06 SIGGRAPH

Winner： Kaoru Sasaki, Kazuhito Sato, Shugo Yamaguchi, Keitaro Tanaka, Shigeo Morishima
小野梓記念学術賞

2025.03 早稲田大学

Winner：吉永朋矢, 田中啓太郎, 坂東宜昭, 井本桂右, 森島繁生
音楽情報科学研究会 2024年度 COCOTONE賞

2025.03 情報処理学会
2024年秋季研究発表会学生優秀発表賞

2024.12 日本音響学会
電気音響研究会学生研究奨励賞

2024.11 日本音響学会
コンピュータグラフィックスとビジュアル情報学 (CGVI) 研究会学生発表賞

2024.09 情報処理学会
VC学生研究賞

2024.09 Visual Computing
第141回音楽情報科学研究会夏のシンポジウムベストプレゼンテーション賞 (Best Research 部門)

2024.08 情報処理学会
Best Student Paper Contest Finalist

2023.08 European Signal Processing Conference (EUSIPCO)
第85回全国大会学生奨励賞

2023.03 情報処理学会

Winner：神庭有花, 田中啓太郎, 平田明日香, 森島繁生
第85回全国大会学生奨励賞

2023.03 情報処理学会

Winner：柏木爽良, 田中啓太郎, 森島繁生
物理応物修士論文賞（宮部賞）

2022.03 早稲田大学
Japan Student Conference Paper Award

2021.12 IEEE Signal Processing Society (SPS)
小野梓記念学術賞

2021.03 早稲田大学
第83回全国大会学生奨励賞

2021.03 情報処理学会
第128回音楽情報科学研究会夏のシンポジウムベストプレゼンテーション賞

2020.08 情報処理学会
第82回全国大会大会奨励賞

2020.05 情報処理学会
第82回全国大会学生奨励賞

2020.03 情報処理学会

▼display all

Papers

ビット操作に基づく学習不要かつ高セキュリティな3D Gaussian Splattingステガノグラフィ

佐々木馨, 佐藤和仁, 山口周悟, 田中啓太郎, 森島繁生

Visual Computing (VC) 2025.09 [Refereed]
Training Onset-and-Offset Aware Sound Event Detection on a Heterogeneous Dataset via Probabilistic Sequential Modeling

Tomoya Yoshinaga, Yoshiaki Bando, Keitaro Tanaka, Keisuke Imoto, Masaki Onishi, Shigeo Morishima

Annual Conference of the International Speech Communication Association (Interspeech) 2025.08 [Refereed]
Cross-lingual Data Selection Using Clip-level Acoustic Similarity for Enhancing Low-resource Automatic Speech Recognition

Shunsuke Mitsumori, Sara Kashiwagi, Keitaro Tanaka, Shigeo Morishima

Annual Conference of the International Speech Communication Association (Interspeech) 2025.08 [Refereed]
Hide A Bit: A Training-Free and High-Fidelity Steganography Method for 3D Gaussian Splatting Based on Bit Manipulation and RSA Encryption

Kaoru Sasaki, Kazuhito Sato, Shugo Yamaguchi, Keitaro Tanaka, Shigeo Morishima

ACM International Conference and Exhibition on Computer Graphics and Interactive Techniques (SIGGRAPH) Posters 70 1 - 2 2025.08 [Refereed]
数式ドリブン事前学習に基づくロボットの方策学習の検証

岩片彰吾, 元田智大, 山田亮佑, 牧原昂志, 中條亨一, 田中啓太郎, 片岡裕雄, 森島繁生

画像の認識・理解シンポジウム (MIRU) ポスター発表 2025.08
Hide A Bit: 3D Gaussian Splattingに対するビット操作とRSA暗号に基づく学習不要な高品質ステガノグラフィ

佐々木馨, 佐藤和仁, 山口周悟, 田中啓太郎, 森島繁生

画像の認識・理解シンポジウム (MIRU) 口頭発表 2025.07 [Refereed]
口パク動画の発話内容推測モデルの学習における効率的な他言語データ活用に向けて

三森俊祐, 柏木紗良, 田中啓太郎, 森島繁生

画像の認識・理解シンポジウム (MIRU) ポスター発表 2025.07
AcousticPerformer: 再構成音を用いた双ドメイン損失による楽器演奏モーション生成

西澤大樹, Seong Jong Yoo, 田中啓太郎, 山口周悟, 馮起, 森島繁生

画像の認識・理解シンポジウム (MIRU) ポスター発表 2025.07
Formula-Supervised Sound Event Detection: Pre-Training Without Real Data

Yuto Shibata, Keitaro Tanaka, Yoshiaki Bando, Keisuke Imoto, Hirokatsu Kataoka, Yoshimitsu Aoki

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 1 - 5 2025.04 [Refereed]
SyncViolinist: Music-Oriented Violin Motion Generation Based on Bowing and Fingering

Hiroki Nishizawa, Keitaro Tanaka, Asuka Hirata, Shugo Yamaguchi, Qi Feng, Masatoshi Hamanaka, Shigeo Morishima

情報処理学会第142回音楽情報科学研究会国際既発表セッション 25 ( 2 ) 2025.03
低リソース言語の自動音声認識における他言語データの効率的利用

三森俊祐, 柏木爽良, 田中啓太郎, 森島繁生

情報処理学会第87回全国大会 2025.03
音響イベント検出のための隠れセミマルコフモデルに基づくユニバーサル・イベント単位学習

吉永朋矢, 田中啓太郎, 坂東宜昭, 井本桂右, 大西正輝, 森島繁生

情報処理学会第87回全国大会 2025.03
SyncViolinist: Music-Oriented Violin Motion Generation Based on Bowing and Fingering

Hiroki Nishizawa, Keitaro Tanaka, Asuka Hirata, Shugo Yamaguchi, Qi Feng, Masatoshi Hamanaka, Shigeo Morishima

IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 5419 - 5428 2025.03 [Refereed]
Unsupervised Pitch-Timbre-Variation Disentanglement of Monophonic Music Signals Based on Random Perturbation and Re-entry Training

Keitaro Tanaka, Kazuyoshi Yoshii, Simon Dixon, Shigeo Morishima

APSIPA Transactions on Signal and Information Processing 14 ( 1 ) 2025.02 [Refereed]
Capturing Dynamic Identity Features for Speaker-Adaptive Visual Speech Recognition

Sara Kashiwagi, Keitaro Tanaka, Shigeo Morishima

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 1 - 6 2024.12 [Refereed]
汎用事前学習済みモデルを用いた音響イベント検出のためのHSMMに基づくイベント単位学習

吉永朋矢, 田中啓太郎, 坂東宜昭, 井本桂右, 大西正輝, 森島繁生

日本音響学会応用/電気音響(EA)研究会 2024.11
Onset-and-Offset-Aware Sound Event Detection via Differentiable Frame-to-Event Mapping

Tomoya Yoshinaga, Keitaro Tanaka, Yoshiaki Bando, Keisuke Imoto, Shigeo Morishima

IEEE Signal Processing Letters 32 186 - 190 2024.11 [Refereed]
音源信号の数式ドリブン合成に基づく音響イベント検出の事前学習

柴田優斗, 田中啓太郎, 坂東宜昭, 井本桂右, 片岡裕雄, 青木義満

日本音響学会第152回 (2024年秋季) 研究発表会 2024.09
音響イベント検出のための隠れセミマルコフモデルに基づくイベント単位損失

吉永朋矢, 坂東宜昭, 田中啓太郎, 井本桂右, 大西正輝, 森島繁生

日本音響学会第152回 (2024年秋季) 研究発表会 2024.09
話者固有の発話特性に着目したマルチタスク学習に基づく読唇精度向上手法

柏木爽良, 田中啓太郎, 森島繁生

Visual Computing (VC) Long Track 42 2024.09 [Refereed]
変分オートエンコーダを用いた単旋律音楽信号の音高・音色・変動への分解

田中啓太郎, 吉井和佳, Simon Dixon, 森島繁生

情報処理学会第141回音楽情報科学研究会 2024.08
Keep Eyes on the Sentence: An Interactive Sentence Simplification System for English Learners Based on Eye Tracking and Large Language Models

Taichi Higasa, Keitaro Tanaka, Qi Feng, Shigeo Morishima

ACM CHI Conference on Human Factors in Computing Systems Late-Breaking Work 211 1 - 7 2024.05 [Refereed]
On the Use of Synthesized Datasets and Transformer Adaptors for Musical Instrument Recognition

Keitaro Tanaka, Yin-Jyun Luo, Kin Wai Cheuk, Kazuyoshi Yoshii, Shigeo Morishima, Simon Dixon

International Society for Music Information Retrieval (ISMIR) Late-Breaking Demo 2023.11 [Refereed]
Gaze-Driven Sentence Simplification for Language Learners: Enhancing Comprehension and Readability

Taichi Higasa, Keitaro Tanaka, Qi Feng, Shigeo Morishima

ACM International Conference on Multimodal Interaction (ICMI) workshops, Multimodal, Interactive Interfaces for Education 292 - 296 2023.10 [Refereed]
Detecting Unknown Multiword Expressions in Natural English Reading via Eye Gaze

Taichi Higasa, Asuka Hirata, Keitaro Tanaka, Qi Feng, Shigeo Morishima

Visual Computing (VC) Short Track 38 2023.09 [Refereed]
Audio-Visual Speech Enhancement With Preserving Specific Off-Screen Speech

Tomoya Yoshinaga, Keitaro Tanaka, Shigeo Morishima

Visual Computing (VC) Short Track 39 2023.09 [Refereed]
通常発声と無音発声の動画を用いた発話内容推測における距離学習に基づく精度差改善手法

柏木爽良, 田中啓太郎, 森島繁生

Visual Computing (VC) Long Track 37 2023.09 [Refereed]
Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction

Tomoya Yoshinaga, Keitaro Tanaka, Shigeo Morishima

European Signal Processing Conference (EUSIPCO) 595 - 599 2023.09 [Refereed]
Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning

Sara Kashiwagi, Keitaro Tanaka, Qi Feng, Shigeo Morishima

Annual Conference of the International Speech Communication Association (Interspeech) 3397 - 3401 2023.08 [Refereed]
パッチ分割による拡散確率モデルのメモリ消費量削減の検討

荒川深映, 綱島秀樹, 堀田大地, 田中啓太郎, 森島繁生

画像の認識・理解シンポジウム (MIRU) ポスター発表 2023.07 [Refereed]
Memory Efficient Diffusion Probabilistic Models via Patch-based Generation

Shinei Arakawa, Hideki Tsunashima, Daichi Horita, Keitaro Tanaka, Shigeo Morishima

IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR) workshops, Generative Models for Computer Vision 9 2023.06 [Refereed]
口パク動画の発話内容推測における距離学習に基づく精度向上手法

柏木爽良, 田中啓太郎, 森島繁生

情報処理学会第85回全国大会 2023.03
覚醒度と感情価に基づく音楽による画像スタイル変換

神庭有花, 田中啓太郎, 平田明日香, 森島繁生

情報処理学会第85回全国大会 2023.03
動画内話者の音声強調における特定背景音声の透過

吉永朋矢, 田中啓太郎, 森島繁生

情報処理学会第85回全国大会 2023.03
視線情報と比喩度に基づく英語フレーズの理解度推定

樋笠泰祐, 平田明日香, 田中啓太郎, 森島繁生

インタラクティブシステムとソフトウェアに関するワークショップ (WISS) 2022.12
口パク動画の発話内容推測における距離学習に基づく精度向上手法の検討

柏木爽良, 田中啓太郎, 森島繁生

ビジュアルコンピューティングワークショップ (VCWS) 2 2022.11
入力動画に対する動画内話者と特定背景話者の同時音声抽出

吉永朋矢, 田中啓太郎, 森島繁生

ビジュアルコンピューティングワークショップ (VCWS) 3 2022.11
Unsupervised Disentanglement of Timbral, Pitch, and Variation Features From Musical Instrument Sounds With Random Perturbation

Keitaro Tanaka, Yoshiaki Bando, Kazuyoshi Yoshii, Shigeo Morishima

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 709 - 716 2022.11 [Refereed]
Patch-based Memory Efficient Diffusion Probabilistic Models

Shinei Arakawa, Hideki Tsunashima, Daichi Horita, Keitaro Tanaka, Shigeo Morishima

Visual Computing (VC) Posters 10 2022.10
運指と運弓を反映した音響信号からのヴァイオリン演奏アニメーションの自動生成

平田明日香, 田中啓太郎, 浜中雅俊, 森島繁生

Visual Computing (VC) Short Track 28 2022.10 [Refereed]
Audio-Driven Violin Performance Animation with Clear Fingering and Bowing

Asuka Hirata, Keitaro Tanaka, Masatoshi Hamanaka, Shigeo Morishima

ACM International Conference and Exhibition on Computer Graphics and Interactive Techniques (SIGGRAPH) Posters 7 1 - 2 2022.08 [Refereed]
視線情報を用いた英語フレーズの理解度推定

樋笠泰祐, 平田明日香, 田中啓太郎, 森島繁生

情報処理学会第84回全国大会 559 - 560 2022.03
弓遣いに基づく弦楽器演奏モーションの自動生成

平田明日香, 田中啓太郎, 島村僚, 森島繁生

Visual Computing (VC) Short Track 33 2021.09 [Refereed]
Bowing-Net: Motion Generation for String Instruments Based on Bowing Information

Asuka Hirata, Keitaro Tanaka, Ryo Shimamura, Shigeo Morishima

ACM International Conference and Exhibition on Computer Graphics and Interactive Techniques (SIGGRAPH) Posters 40 1 - 2 2021.08 [Refereed]
Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex

Keitaro Tanaka, Ryosuke Sawata, Shusuke Takahashi

Annual Conference of the International Speech Communication Association (Interspeech) 1134 - 1138 2021.08 [Refereed]
変分自己符号化器を用いた距離学習による楽器音の音高・音色分離表現

田中啓太郎, 錦見亮, 坂東宜昭, 吉井和佳, 森島繁生

情報処理学会第131回音楽情報科学研究会・第137回音声言語情報処理研究会共催研究会 2021.06
Pitch-Timbre Disentanglement of Musical Instrument Sounds Based on VAE-Based Metric Learning

Keitaro Tanaka, Ryo Nishikimi, Yoshiaki Bando, Kazuyoshi Yoshii, Shigeo Morishima

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 111 - 115 2021.06 [Refereed]
弓動作を反映した演奏モーションの自動生成

平田明日香, 田中啓太郎, 島村僚, 森島繁生

情報処理学会第83回全国大会 263 - 264 2021.03
弓動作に着目した弦楽器演奏モーションの自動生成

平田明日香, 田中啓太郎, 島村僚, 森島繁生

Visual Computing (VC) Posters 42 2020.12
Multi-Instrument Music Transcription Based on Deep Spherical Clustering of Spectrograms and Pitchgrams

Keitaro Tanaka, Takayuki Nakatsuka, Ryo Nishikimi, Kazuyoshi Yoshii, Shigeo Morishima

International Society for Music Information Retrieval (ISMIR) 327 - 334 2020.10 [Refereed]
スペクトログラムとピッチグラムの深層クラスタリングに基づく複数楽器パート採譜

田中啓太郎, 中塚貴之, 錦見亮, 吉井和佳, 森島繁生

情報処理学会第128回音楽情報科学研究会 2020.08
深層クラスタリングを用いた任意楽器パートの自動採譜

田中啓太郎, 中塚貴之, 錦見亮, 吉井和佳, 森島繁生

情報処理学会第82回全国大会 365 - 366 2020.03

▼display all

Research Projects

音の三要素に基づく生成過程を考慮した深層ベイズ自動採譜

日本学術振興会科学研究費助成事業

Project Year :

2022.04

-

2025.03

田中啓太郎

　View Summary

本研究では，音楽音響信号を構成する全ての楽器に対して各楽譜を推定する，多楽器自動採譜技術を扱う．本年度は主に，昨年度開発した奏法の違いを陽に考慮する三要素分離手法の発展に取り組んだ．従来モデルは単音の楽器音に対しては動作したが，時変音高を有する単旋律入力や歌声に対しては，所望の潜在特徴が他空間に漏洩してしまう問題があった．
この問題を受け，従来モデルを構成モジュールとして保持しつつ，新たな確率的生成モデルを定式化した．具体的には，エンコーダとデコーダをそれぞれ２回使用する構造により，各潜在空間において不必要な情報が徐々に淘汰されていく訓練手法を提案した．これにより，多様な入力に対する潜在空間での分離精度を向上させることに成功した．
加えて，楽器認識に焦点を当てた研究も行った．音楽情報処理分野において楽器認識タスクは主流な問題設定の一つである．しかしながら，限られたベンチマークデータセットでのみ精度が評価されており，特にデータ量の少ない他のデータセットに対しては，期待される認識精度が得られないという課題があった．そこで，人工的に作成されたデータセットの効率的な活用法を提案し，他のデータセットに対する認識精度を向上させることに成功した．
さらに本年度は，派生技術の他ドメイン展開にも取り組んだ．三要素分離手法の前身である同質性に着目した距離学習は，話し方の違いに頑健な読唇術手法へ，三要素分離手法の根幹である時変時不変の性質に着目したモデル構造は，効率的かつ高精度な視聴覚音声強調手法へ，それぞれ応用に成功した．