研究者詳細 - 渡辺　裕

写真a

ワタナベ　ヒロシ

渡辺　裕

Google Scholar 情報（Citations per year）

Citation: 4393 h-index: 22 i10-index: 48

Click to view the Google Scholar page.

所属

理工学術院基幹理工学部

職名

教授

学位

工学博士 ( 北海道大学 )

ホームページ

https://www.ams.giti.waseda.ac.jp

プロフィール

1980年北大・工・電子卒．1985年同大大学院博士課程修了．同年NTT入社．NTT研究所において画像の高能率符号化及びMPEG標準化に従事．2000年早稲田大学大学院国際情報通信研究科教授．現在、早稲田大学基幹理工学部情報通信学科教授．マルチメディアの符号化及び配信に関する研究に興味を持つ．工博．IEEE，電子情報通信学会，映像情報メディア学会，情報処理学会，画像電子学会各会員．

経歴

2013年04月

-

　

早稲田大学基幹理工学部情報通信学科教授
2001年04月

-

2013年03月

早稲田大学大学院国際情報通信研究科教授
2000年09月

-

2001年03月

早稲田大学国際情報通信研究センター教授
1995年04月

-

2000年09月

NTTヒューマンインタフェース研究所主幹研究員
1991年03月

-

1994年03月

NTTヒューマンインタフェース研究所特別研究員
1990年06月

-

1991年05月

Bell Communications Research 客員研究員
1987年04月

-

1989年03月

NTTヒューマンインタフェース研究所主任研究員
1987年04月

-

1989年03月

NTT ヒューマンインタフェース研究所画像通信研究部研究主任
1985年04月

-

1987年03月

NTT 横須賀電気通信研究所画像通信研究部研究員

▼全件表示

学歴

1980年04月

-

1985年03月

北海道大学大学院工学研究科電子工学
1976年04月

-

1980年03月

北海道大学工学部電子工学科

所属学協会

　

　

　

IEEE (The Institute of Electrical and Electronic Engineers)
　

　

　

情報処理学会
　

　

　

画像符号化シンポジウム
　

　

　

IEEE Circuits and Systems Society
　

　

　

電子情報通信学会
　

　

　

画像電子学会
　

　

　

映像情報メディア学会

▼全件表示

研究分野

知能情報学

研究キーワード

情報通信工学

受賞

情報処理学会情報規格調査会標準化貢献賞

2005年
日本規格協会標準化文献賞奨励賞

1998年
画像電子学会技術賞

1997年
映像情報メディア学会丹羽高柳著述賞

1997年
電子情報通信学会学術奨励賞

1988年

論文

GSA: guided supervised attention for enhancing real-time dense object detection via parameter-free scheme

Haoyuan Liu, Hiroshi Watanabe

SPIE digital library, Journal of Electronic Imaging 33 ( 06 ) 2024年11月 [査読有り]

DOI
Image Coding for Machines with Edge Information Learning Using Segment Anything

Takahiro Shindo, Kein Yamada, Taiju Watanabe, Hiroshi Watanabe

IEEE International Conference on Image Processing (ICIP 2024) 3702 - 3708 2024年11月 [査読有り]

DOI
Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss

Kei Iino, Shunsuke Akamatsu, Hiroshi Watanabe, Shohei Enomoto, Akira Sakamoto, Takeharu Eda

IEEE International Conference on Image Processing (ICIP 2024) 3771 - 3777 2024年10月 [査読有り]

DOI
GEEG-YOLOv8: Gaussian Enhanced Euclidian Norm Ghost Attention for Real-time Polyp Detection

Phuong Thao Ngyuen, Hiroshi Watanabe

IEEE International Conference on Image Processing (ICIP 2024) 3057 - 3063 2024年10月 [査読有り]

DOI
Scalable Image Coding for Humans and Machines Using Feature Fusion Network

Takahiro Shindo, Yui Tatsumi, Taiju Watanabe, Hiroshi Watanabe

IEEE International Workshop on Multimedia Signal Processing (MMSP2024) 1994 - 2000 2024年10月 [査読有り]

DOI
Perceptual Image Compresion via Stable Diffusion at Low Bitrate

Luoxu Jin, Hiroshi Watanabe

IEEE International Conference on Multimedia Information Processing and Retrieval (MIPR 2024) 626 - 629 2024年08月 [査読有り]

DOI
[Paper] Tracked QEM Algorithm: Adding Temporal Consistency to Dynamic Mesh Simplification Based on Mesh Registration

Yunlong Liu, Jianfeng Xu, Kei Kawamura, Hiroshi Watanabe

ITE Transactions on Media Technology and Applications 12 ( 3 ) 175 - 189 2024年 [査読有り]

DOI
Accurate and Efficient Surface Reconstruction from Point Clouds via Geometry-Aware Local Adaptation

Eito Ogawa, Taiga Hayami, Hiroshi Watanabe

IIEEJ International Conference on Image Electronics and Visual Computing (IEVC 2026) 2026年03月 [査読有り]

DOI
Flow Residual Segmentation and Generative Reconstruction for Motion-Aware Video Coding

Ziyue Zeng, Yui Tatsumi, Hiroshi Watanabe

IIEEJ International Conference on Image Electronics and Visual Computing (IEVC 2026) 2026年03月 [査読有り]

DOI
Enhancing RGB-IR Object Detection: A Frozen Backbone Approach with Multi-Receptive Field Attention

Bingyu Lu, Haoyuan Liu, Hiroshi Watanabe

Springer, The Visual Computer 42 ( 164 ) 1 - 13 2026年02月 [査読有り]

DOI
Robust Bounding Box Regression Loss within an Interpolation-Based IoU Framework

Haoyuan Liu, Hiroshi Watanabe

Elsevier, Neurocomputing 666 ( 132230 ) 1 - 12 2026年02月 [査読有り]

DOI
Contour Information Aware 2D Gaussian Splatting for Image Representation

Masaya Takabe, Hiroshi Watanabe, Sujun Hong, Tomohiro Ikai, Zheming Fan, Ryo Ishimoto, Kakeru Sugimoto, Ruri Imichi

IEEE International Conference on Consumer Electronics (ICCE 2026) 2026年02月 [査読有り]

DOI
Training-Free Adaptive;Quantization for Variable Rate Image Coding for Machines

Yui Tatsumi, Ziyue Zeng, Hiroshi Watanabe

IEEE International Conference on Consumer Electronics (ICCE 2026) 2026年02月 [査読有り]

DOI
A Lightweight Channel Bias for Small Object Detection in Aerial Images

Hongrui Fan, Haoyuan Liu, Hiroshi Watanabe

Proc. SPIE, The International Workshop on Advanced Image Technology (IWAIT2026) 14072 1 - 4 2026年02月 [査読有り]

DOI
Time Step Generating: A Universal Synthesized Deepfake Image Detector

Ziyue Zeng, Yupei Guo, Haoyuan Liu, Dingjie Peng, Hiroshi Watanabe

ACM Multimedia Asia (MMAsia2025) 24 1 - 8 2025年12月 [査読有り]

DOI
構造保持パッチデコードの時間方向拡張による効率的映像表現

速見泰雅, 渡辺裕

画像符号化シンポジウム (PCSJ/IMPS2025) P3-05 2025年11月
局所サイズを適応調整する点群メッシュ化手法

小川英人, 速見泰雅, 渡辺裕

画像符号化シンポジウム (PCSJ/IMPS2025) P3-10 2025年11月
鏡面反射を考慮した 3D Convex Splatting による高リアリティレンダリング

中村悠大, 髙部雅矢, 速見泰雅, 渡辺裕

画像符号化シンポジウム (PCSJ/IMPS2025) P3-10 2025年11月
VFI-LoRA: Leveraging Video Diffusion Models for Video Interpolation Through LoRA Finetuning

Taiju Watanabe, Takahiro Shindo, Yui Tatsumi, Hiroshi Watanabe

IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS2025) 254 - 259 2025年11月 [査読有り]

DOI
Bidirectional Attention-Gated Motion Injection for Frame Interpolation

Ziyue Zeng, Yui Tatsumi, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2025) RAV 53 - 57 2025年09月 [査読有り]

DOI
SR-NeRV: Improving Embedding Efficiency of Neural Video Representation via Super-Resolution

Taiga Hayami, Kakeru Koizumi, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2025) OS-TMR(1) 881 - 884 2025年09月 [査読有り]

DOI
3D Gaussian Splatting With Semantic Information Based on Semantic Class Labels

Masaya Takabe, Taiju Watanabe, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2025) OS-TMR(1) 877 - 878 2025年09月 [査読有り]

DOI
Semantic Reconstruction for Unified Detection of Local and Logical Anomalies

Kakeru Koizumi, Taiga Hayami, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2025) OS-TMR(2) 1081 - 1084 2025年09月 [査読有り]

DOI
Depth-Wise Federated Learning Based on Model Parameter Reliability

Ryosuke Nutaba, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2025) OS-TMR(2) 1095 - 1086 2025年09月 [査読有り]

DOI
Clinically Prioritized Attention-Based Fusion of Multi-Plane Knee MRI for Robust Injury Detection

Taira Kunitomi, Taiga Hayami, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2025) OS-IMD(2) 485 - 488 2025年09月 [査読有り]

DOI
Seed Selection for Human-Oriented Image Reconstruction via Guided Diffusion

Yui Tatsumi, Ziyue Zeng, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2025) OS-TMR(3) 1281 - 1284 2025年09月 [査読有り]

DOI
Enhancing Continuous Emotion Recognition via Visually Diverse Frame Selection

Soki Saigo, Taiga Hayami, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2025) OS-TMR(3) 1275 - 1278 2025年09月 [査読有り]

DOI
Image-Adaptive Context Modeling for Compression Based on Implicit Neural Representations

Shoko Tanaka, Takahiro Shindo, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2025) OS-TMR(3) 1279 - 1280 2025年09月 [査読有り]

DOI
Brain Rot Detection in Users: Analyzing Behavioral Insights from Large-Scale User Interactions

Kein Yamada, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2025) POS-1A 118 - 122 2025年09月

DOI
Guided Diffusion for the Extension of Machine Vision to Human Visual Perception

Takahiro Shindo, Yui Tatsumi, Taiju Watanabe, Hiroshi Watanabe

IEEE International Workshop on Multimedia Signal Processing (MMSP2025) 1 - 6 2025年09月 [査読有り]

DOI
Explicit Residual-Based Scalable Image Coding for Humans and Machines

Yui Tatsumi, Ziyue Zeng, Hiroshi Watanabe

IEEE International Workshop on Multimedia Signal Processing (MMSP2025) 1 - 6 2025年09月 [査読有り]

DOI
Structure-Preserving Patch Decoding for Efficient Neural Video Representation

Taiga Hayami, Kakeru Koizumi, Hiroshi Watanabe

IEEE International Workshop on Multimedia Signal Processing (MMSP2025) 1 - 6 2025年09月 [査読有り]

DOI
Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation

Luoxu Jin, Hiroshi Watanabe

IEEE International Workshop on Multimedia Signal Processing (MMSP2025) 1 - 6 2025年09月 [査読有り]

DOI
Automating Task Monitoring in Industrial Settings Using Combined Gesture Recognition and Object Detection

Smita Priyadarshani, Kakeru Koizumi, Catherine Kuramitsu, Hiroshi Watanabe, Sooyoung Jeong

IFIP Advances in Information and Communication Technology 543 - 558 2025年08月 [査読有り]

DOI
Graphical Approach of Indentifying Patterns of Social Media Addiction

山田啓印, 渡辺裕

2025年映像情報メディア学会年次大会 ( 11B-4 ) 2025年08月
Improving the Accuracy of Pose-Guided Human Image Generation via Multi-Stage ControlNet Fine-Tuning

伊玖, 渡辺裕

2025年映像情報メディア学会年次大会 ( 13B-4 ) 2025年08月
A Study on 3D Human Pose Estimation via Diffusion Models

劉心毅, 渡辺裕

2025年映像情報メディア学会年次大会 ( 23-B3 ) 2025年08月
視聴と画像認識のための階層画像符号化における残差情報の活用とその有効性の検討

巽優衣, Ziyue Zeng, 渡辺裕

画像の認識・理解シンポジウム 2025 IS1-159 1 - 5 2025年07月
Image Coding for Object Recognition on Contour Feature Learning with Flexible Object Selection

Takahiro Shindo, Taiju Watanabe, Yui Tatsumi, Hiroshi Watanabe

IEEE ACCESS 13 106606 - 106617 2025年06月 [査読有り]

DOI
色情報を用いた点群分類における代表点選択

小野蒼平, 渡辺裕

電子情報通信学会総合大会 ( D-12-58 ) 2025年03月
A Refined DARTS-based Method for Speaker Recognition

Minghao Duan, Hiroshi Watanabe

電子情報通信学会総合大会 ( D-14-04 ) 2025年03月
モデルのパラメータの信頼度を考慮したFederated Learning

蒐場涼介, 渡辺裕

情報処理学会全国大会 ( 5X-07 ) 2025年03月
機械のための復号画像における顔認証精度の評価

進藤嵩紘, 渡部泰樹, 巽優衣, 渡辺裕

情報処理学会全国大会 ( 1X-05 ) 2025年03月
事前学習済みの拡散モデルを使用したフレーム補間

渡部泰樹, 進藤嵩紘, 巽優衣, 渡辺裕

情報処理学会全国大会 ( 6X-03 ) 2025年03月
Delta-ICM: Entropy Modeling with Delta Function for Learned Image Compression

Takahiro Shindo, Taiju Watanabe, Yui Tatsumi, Hiroshi Watanabe

IEEE International Conference on Consumer Electronics (ICCE 2025) 2025年01月 [査読有り]

DOI
Neural Video Representation for Redundancy Reduction and Consistency Preservation

Taiga Hayami, Takahiro Shindo, Shunsuke Akamatsu, Hiroshi Watanabe

IEEE International Conference on Consumer Electronics (ICCE 2025) 2025年01月 [査読有り]

DOI
ICM手法のプライバシー保護における有効性の検証

進藤嵩紘, 渡部泰樹, 巽優衣, 渡辺裕

画像符号化シンポジウム2024 ( P04-1 ) 2024年11月
A Study of Video Frame Interpolation Based on Pre-trained Video Diffusion Models Using Parameter-efficient Fine-tuning

Jin Louxu, Hiroshi Watanabe, Takeshi Chujoh, Zheming Fan, Sujun Hong, Ryo Ishimoto, Kakeru Sugimoto, Tomohiro Ikai

( P4-14 ) 2024年11月
空間と時間的一貫性のある動画表現の一検討

速見泰雅, 進藤嵩紘, 渡辺裕

画像符号化シンポジウム2024 ( P2-15 ) 2024年11月
特徴量差分を用いた視聴と画像認識のための階層画像符号化

巽優衣, 進藤嵩紘, 渡部泰樹, 渡辺裕

画像符号化シンポジウム2024 ( P2-01 ) 2024年11月
顔認識システムにおけるプロンプトベース画像符号化と検出器の組み合わせによる精度評価

藤浪優里佳, 渡辺裕

画像符号化シンポジウム2024 ( P1-12 ) 2024年11月
Distilled RSN: Lightweight Pose Estimation Using Knowledge Distillation

Jiu Yi, Haoyuan Liu, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2024) ( OS-AIP(3) ) 882 - 884 2024年11月 [査読有り]

DOI
Refining Coded Image in Human Vision Layer Using CNN-Based Post-Processing

Takahiro Shindo, Yui Tatsumi, Taiju Watanabe, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2024) ( OS-TMR(2) ) 181 - 182 2024年11月 [査読有り]

DOI
Integrating QR Code Characteristics into SUper-Resolution Method

Shoko Tanaka, Yui Tatsumi, Takahiro Shindo, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2024) ( OS-TMR(3) ) 328 - 329 2024年11月 [査読有り]

DOI
Classification in Japanese Sign Language Based on Dynamic Facial Expressions

Yui Tatsumi, Shoko Tanaka, Shunsuke Akamatsu, Takahiro Shindo, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2024) ( POS3A ) 1006 - 1007 2024年11月 [査読有り]

DOI
Enhancing Object Detection with Non-Local Modules in The Feature Pyramid Network

Hongrui Fan, Haoyuan Liu, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2024) ( OS-AIP(3) ) 890 - 893 2024年11月 [査読有り]

DOI
Adversarial Level of Face Images Generated by Prompt-Based Image Coding in Face Recognition System

Yurika Fujinami, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2024) ( OS-TMR(3) ) 332 - 333 2024年11月 [査読有り]

DOI
Implicit Neural Representation for Videos Based on Residual Connection

Taiga Hayami, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2024) ( OS-TMR(3) ) 330 - 331 2024年11月 [査読有り]

DOI
Acceletation Method for Super-Resolution Based on Diffusion Models by Intermediate Step Predicition

Jichen Ma, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2024) ( OS-TNR(2) ) 177 - 178 2024年11月 [査読有り]

DOI
Class Activation Mappingを用いた視線推定手法の評価および有効性の確認

杉山秀治, 福田大翔, 中山光典, 渡辺裕

映像情報メディア学会年次大会 ( 11A-4 ) 2024年08月
自動ボールストライク判定におけるフロートラッキングを用いた処理量削減法

福田大翔, 中山光典, 杉山秀治, 渡辺裕

映像情報メディア学会年次大会 ( 11A-5 ) 2024年08月
画素値の動的変化に基づく動画像表現

進藤嵩紘, 速見泰雅, 田中頌子, 渡辺裕

映像情報メディア学会年次大会 ( 13B-3 ) 2024年08月
Cross-Frame Attentionを用いた映像補間モデルの一検討

渡部泰樹, 進藤嵩紘, 巽優衣, 渡辺裕

映像情報メディア学会年次大会 ( 13B-4 ) 2024年08月
主成分分析を用いたエッジの移動抑制による点群平滑化の改善手法

中山光典, 杉山秀治, 福田大翔, 渡辺裕

映像情報メディア学会年次大会 ( 13B-5 ) 2024年08月
Bounding Box Aware Edge-Cloud Collaborative Method for Multiple Object Detection

Shunsuke Akamatsu, Hiroshi Watanabe

IEEE World Conference on Applied Intelligence and Computing (AIC2024) 1155 - 1159 2024年07月 [査読有り]

DOI
Event-based Robust 3D Pose Estimation Using Time Series Data

Kakeru Koizumi, Hiroshi Watanabe

IEEE World Conference on Applied Intelligence and Computing (AIC2024) 1223 - 1227 2024年07月 [査読有り]

DOI
Phase Repair for Time-Domain Convolutional Neural Networks in Music Super-Resolution

Yenan Zhang, Guilly Kolkman, Hiroshi Watanabe

Sound Music and Computing (SMC2024) ( 44 ) 1 - 6 2024年07月 [査読有り]
野球におけるボールストライク自動判定手法の提案

福田大翔, 足立翔平, 渡辺裕

ITE-SIP2024-7 20 - 23 2024年03月
アピアランスベースの視線推定における実現可能性を考慮した精度向上

杉山秀治, 福田大翔, 渡辺裕

情報処理学会全国大会 7U-02 2024年03月
ホモグラフィ変換に基づくホームベースの大きさ情報を活用したストライクゾーン取得精度の向上

福田大翔, 杉山秀治, 渡辺裕

情報処理学会全国大会 5S-04 2024年03月
野球試合映像からの高速な全投球映像抽出手法の検討

足立翔平, 福田大翔, 渡辺裕

映像情報メディア学会スポーツ情報処理研究会 ITE-SIP2024-6 16 - 19 2024年03月
A Real-time Polyp Detection Method Based on GhostAtt-YOLOv8

Phong Thao Nguyen, Hiroshi Watanabe

IIEEJ International Conference on Image Electronics and Visual Computing (IEVC2024) 2024年03月 [査読有り]
Object Detection Method for Drone Videos Using Optical Flow

Takahiro Shindo, Kein Yamada, Taiju Watanabe, Hiroshi Watanabe

IIEEJ International Conference on Image Electronics and Visual Computing (IEVC2024) ( LBP-05 ) 2024年03月
Image Coding for Machines with Objectness-based Feature Distillation

Kei Iino, Shunsuke Akamatsu, Hiroshi Watanabe, Shohei Enomoto, Akira Sakamoto, Takeharu Eda

IIEEJ International Conference on Image Electronics and Visual Computing (IEVC2024) ( LBP-07 ) 2024年03月
Edge-Cloud Collaborative Object Detection Model with Feature Compression

Shunsuke Akamatsu, Kei Iino, Hiroshi Watanabe, Shohei Enomoto, Akira Sakamoto, Takeharu Eda

IIEEJ International Conference on Image Electronics and Visual Computing (IEVC2024) ( LBP-12 ) 2024年03月
3D Pose Estimation Using Time Series Data in Event-based Video

Kakeru Koizumi, Hiroshi Watanabe

IIEEJ International Conference on Image Electronics and Visual Computing (IEVC2024) ( LBP-28 ) 2024年03月
Data Augmentation with 3D-rendered Models for Livestock Recognition Using Drone Footage

Kein Yamada, Takahiro Shindo, Taiju Watanabe, Hiroshi Watanabe

IIEEJ International Conference on Image Electronics and Visual Computing (IEVC2024) ( BP-29 ) 2024年03月
The Effect of Edge Information in Stable Diffusion Applied to Image Coding

Hiroshi Watanabe, Luoxu Jin, Taiga Hayami, Takeshi Chujoh, Yukinobu Yasugi, Sujun Hong, Zheming Fan, Tomohiro Ikai

IIEEJ International Conference on Image Electronics and Visual Computing (IEVC2024) ( LBP-15, ) 2024年03月
画像認識向け深層画像圧縮における補助損失の導入

飯野景, 赤松俊輔, 渡辺裕, 江田毅晴, 榎本昇平, 坂本啓

電子情報通信学会総合大会 ( D12A-37 ) 2024年03月
残差MLPと注意機構を組み合わせた点群分類手法の検討

范洪睿, 渡辺裕

電子情報通信学会総合大会 ( D-12B-13 ) 2024年03月
NeRFに基づくフレーム補間手法の品質改善

速見泰雅, 金洛旭, 渡辺裕

電子情報通信学会総合大会 ( D-11A-26 ) 2024年03月
エッジ・色情報を反映したプロンプトベースの画像符号化

渡辺裕, 金洛旭, 速見泰雅, 中條健, 八杉将伸, 洪秀俊, 范哲銘, 猪飼知宏

電子情報通信学会総合大会 ( D-11A-27 ) 2024年03月
Image Coding for Machines with Object Region Learning

Takahiro Shindo, Taiju Watanabe, Kein Yamada, Hiroshi Watanabe

IEEE Consumer Communications & Networking Conference (CCNC 2024) 2024年01月 [査読有り]

DOI
Enhancing Gaze Estimation through Fine-Grained Analysis of Eye Region

Hideharu Sugiyama, Hiroshi Watanabe

International Workshop on Advanced Image Technology (IWAIT 2024) ( No. 11 ) 2024年01月 [査読有り]

DOI
Attention-Enhanced Graph Convolutional Network for Assessing Rehabilitation Exercises

Smita Priyadarshani, Hiroshi Watanabe

International Workshop on Advanced Image Technology (IWAIT 2024) ( Vol. 73 ) 2024年01月 [査読有り]

DOI
Prompt-based Image Coding with Edge Information

Hiroshi Watanabe, Luoxu Jin, Taiga Hayami, Takeshi Chujoh, Tomoko Aono, Yukinobu Yasugi, Sujun Hong, Zheming Fan, Tomohiro Ikai

2023 Picture Coding Symposium, Image Media Symposium (PCSJ/IMPS2023) ( P1-12 ) 2023年11月
NeRFおよび特徴マップに基づくフレーム補間手法の特性評価

速見泰雅, 金洛旭, 渡辺裕, 中條健, 青野友子, 八杉将伸, 洪秀俊, 范哲銘, 猪飼知宏

2023年画像符号化シンポジウム・2023年映像メディア処理シンポジウム（PCSJ/IMPS2023) ( P2-05 ) 2023年11月
Post-processing Based Image Coding via Stable Diffusion

Luoxu Jin, Hiroshi Watanabe, Taiga Hayami, Takeshi Chujoh, Tomoko Aono, Yukinobu Yasugi, Sujun Hong, Zheming Fan, Tomohiro Ikai

2023 Picture Coding Symposium, Image Media Symposium (PCSJ/IMPS2023) ( P3-08 ) 2023年11月
Video Salient Object Detection Using Multi-Scale Self-Attention

Jiahao Liu, Haoyuan Liu, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2023) ( OS-VDP ) 370 - 373 2023年11月 [査読有り]

DOI
BigWavGAN: A Wave-To-Wave Generative Adversarial Network for Music Super-Resolution

Yenan Zhang, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2023) ( OS-AAT ) 596 - 6000 2023年11月 [査読有り]

DOI
Future Object Detection Using Frame Prediction

Taiju Watanabe, Takahiro Shindo, Kein Yamada, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2023) ( OS-TMR ) 948 - 949 2023年11月 [査読有り]

DOI
VVC Extension Scheme for Object Detection Using Contrast Reduction

Takahiro Shindo, Taiju Watanabe, Kein Yamada, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2023) ( OS-TMR ) 1104 - 1105 2023年11月 [査読有り]

DOI
Accuracy Consistency of Object Detection With Contrast Reduction by Pixel Value Limitation

Kein Yamada, Takahiro Shindo, Taiju Watanabe, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2023) ( OS-TMR ) 1108 - 1111 2023年11月 [査読有り]

DOI
Point Cloud Denoising and Outlier Detection with Local Geometric Structure by Dynamic Graph CNN

Kosuke Nakayama, Hiroto Fukuta, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2023) ( OS-AIP ) 156 - 159 2023年10月 [査読有り]

DOI
Three-Dimensional Baseball Strike Judgement by Monocular Video from Umpire’s Viewpoint

Hiroto Fukuta, Kosuke Nakayama, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2023) ( OS-AIP ) 160 - 163 2023年10月 [査読有り]

DOI
Feature Transfer Block for Feature Fusion in Lightweight Object Detectors

Haoyuan Liu, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2023) ( OS-AIR ) 304 - 307 2023年10月 [査読有り]

DOI
A Video Object Detection Method of ECNet Based on Frame Difference and Grid Cell Confidence

Shunsuke Akamatsu, Kei Iino, Hiroshi Watanabe, Shohei Enomoto, Xu Shi, Akira Sakamoto, Takeharu Eda

IEEE Global Conference on Consumer Electronics (GCCE2023) ( OS-VDP ) 366 - 369 2023年10月 [査読有り]

DOI
Measure the Difference Between Frames: Reconsidering the Evaluation Method in Dynamic Mesh Simplification

Yunlong Liu, Jianfeng Xu, Kei Kawamura, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2023) ( OS-VDP ) 374 - 377 2023年10月 [査読有り]

DOI
Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features

Takahiro Shindo, Taiju Watanabe, Kein Yamada, Hiroshi Watanabe

IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET2023) ( No. 139 ) 1 - 5 2023年09月 [査読有り]
骨格情報に基づく動作対応付けの処理量削減に向けたフレーム削減手法の検討

足立翔平, 渡辺裕

電子情報通信学会総合大会 ( D-12-42 ) 2023年03月
特徴量圧縮モデルへの注意機構導入の検討

飯野景, 高橋美穂, 渡辺裕, 江田毅晴, 榎本昇平, 坂本啓, 史旭, 森永一路

電子情報通信学会総合大会 ( D-11-12 ) 2023年03月
VVCとCNNを組み合わせたYOLO-v7のための映像符号化手法

進藤嵩紘, 渡部泰樹, 渡辺裕

電子情報通信学会総合大会 ( D-11-9 ) 2023年03月
目の特徴量を付与したアピアランスベースの視線推定モデルの検討

杉山秀治, 渡辺裕

電子情報通信学会総合大会 ( D-12-52 ) 2023年03月
局所幾何構造を考慮した点群の外れ値検出

中山光典, 原潤一, 渡辺裕

電子情報通信学会総合大会, ( D-12-35 ) 2023年03月
骨格絶対長を考慮した三次元姿勢推定法の検討

福田大翔, 中島聖, 渡辺裕

電子情報通信学会総合大会 ( D-12-67 ) 2023年03月
符号化映像のブロック分割を用いた超解像ネットワークの検討

矢野礼美菜, 劉韞, 渡辺裕, 鈴木拓矢, 范哲銘, 中條健, 猪飼知宏

電子情報通信学会総合大会 ( D-11-10 ) 2023年03月
YOLOVを用いた物体予測検出の一検討

渡部泰樹, 進藤嵩紘, 渡辺裕

電子情報通信学会総合大会 ( D-11-11 ) 2023年03月
Research on Video Captioning with a Late Fusion Based Multimodal Transformer Network

Fei Bao, Hiroshi Watanabe

IPSJ National Convension ( IQ-08 ) 2-195 - 2-196 2023年03月
GANおよびTemplete Matchingを用いた水滴除去手法の検討

有本麻梨香, 原潤一, 渡辺裕

情報処理学会第120回オーディオビジュアル複合情報処理研究会 Vol.2023-AVM-120 ( No.3 ) 1 - 4 2023年03月
骨格の左右対称性を考慮した3次元姿勢推定法の検討

中島聖, 福田大翔, 渡辺裕

情報処理学会第120回オーディオビジュアル複合情報処理研究会 AVM-120 ( No. 4 ) 1 - 2 2023年03月
V-PCCエンコーダにおけるテクスチャ画像のパッチ間平滑化に関する一検討

足立翔平, 木谷佳隆, 河村圭, 渡辺裕

電子情報通信学会画像工学研究会(IE) ( IE2022-54 ) 17 - 18 2023年02月 [招待有り]
Category-based Memory Bank Design for Traffic Surveillance in Context R-CNN

Miho Takahashi, Kei Iino, Hiroshi Watanabe, Ichiro Morinaga, Shohei Enomoto, Xu Shi, Akira Sakamoto, Takeharu Eda

International Workshop on Advanced Image Technology 2023 (IWAIT 2023) ( Vol.12592 ) 125920G-1 - 125920G-4 2023年01月 [査読有り]

DOI
Novel CNN Approach for Video Prediction Based on FitVid

Taiju Watanabe, Takahiro Shindo, Hiroshi Watanabe

Proceedings of SPIE for IWAT2023 ( Vol. 12592 ) 1259210-1 - 1259210-4 2023年01月 [査読有り]

DOI
Memory Bandwidth Constrained Overlapped Block Motion Compensation for Video Coding

Yoshitaka Kidani, Kyohei Ueno, Kei Kawamura, Hiroshi Watanabe

ITE Transactions on Media Technology and Applications (MTA), Vol. 11 ( No. 1 ) 1 - 12 2023年01月 [査読有り]

DOI
A Prior-Guided Face Image Super-Resolution Network Based on Attention Mechanism

Yun Liu, Remina Yano, Hiroshi Watanabe, Takuya Suzuki, Zheming Fan, Takeshi Chujoh, Tomohiro Ikai

International Workshop on Advanced Image Technology 2023 (IWAIT 2023) 12592 ( No. 7 ) 2023年 [査読有り]

DOI
Application of Multi-modal Fusion Attention Mechanism in Semantic Segmentation

Yunlong Liu, Osamu Yoshie, Hiroshi Watanabe

Proceedings of the Asian Conference on Computer Vision (ACCV2022) 1245 - 1264 2022年11月 [査読有り]

DOI
Super Resolution for QR Code Images

Takahiro Shindo, Taiju Watanabe, Remina Yano, Marika Arimoto, Miho Takahashi, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2022) ( OS-TMR(2 ) 274 - 277 2022年10月 [査読有り]

DOI
符号化雑音環境下における物体検出精度の改善手法

進藤嵩紘, 渡部泰樹, 渡辺裕

2022年画像符号化シンポジウム・2022年映像メディア処理シンポジウム ( P1-01 ) 14 - 15 2022年10月
V-PCCエンコーダにおけるテクスチャ画像のパッチ間平滑化に関する一検討

足立翔平, 木谷佳隆, 河村圭, 渡辺裕

2022年画像符号化シンポジウム・2022年映像メディア処理シンポジウム ( P2-09 ) 72 - 73 2022年10月
Selective Use of Skeletal Information to Reduce Computational Complexity of Motion Matching

Shohei Adachi, Ryohei Osawa, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2022) ( OS-ADE ) 68 - 71 2022年10月 [査読有り]

DOI
Motion Matching Based on Pose Similarity Using Movement of Body Parts

Ryohei Osawa, Shohei Adachi, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2022) ( OS-ADE ) 83 - 86 2022年10月 [査読有り]

DOI
Inter-Feature-Map Differential Coding of Surveillance Video

Kei Iino, Miho Takahashi, Hiroshi Watanabe, Ichiro Morinaga, Shohei Enomoto, Xu Shi, Akira Sakamoto, Takeharu Eda

IEEE Global Conference on Consumer Electronics (GCCE2022) ( OS-ASP(1) ) 293 - 296 2022年10月 [査読有り]

DOI
Improving Accuracy of Position Information in Egocentric Pose Estimation Methods

Sho Nakashima, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2022) ( OS-DSC ) 688 - 691 2022年10月 [査読有り]

DOI
Geometric Partitioning Mode with Inter and Intra Prediction for Beyond Versatile Video Coding

Yoshtaka Kidani, Haruhisa Kato, Kei Kawamura, Hiroshi Watanabe

EICE Transaction Information & Systems Vol. E105-D ( No.10 ) 1691 - 1703 2022年10月 [査読有り]
CNNを応用した超解像技術とデノイズ処理による顕微鏡画像の高精細化技術の開発

矢野仁愛, 石川智愛, 渡辺裕, 安井正人

情報処理学会第84回全国大会 ( 6Q-06 ) 2-259 - 2-260 2022年03月
選手の姿勢類似度に基づくスポーツ動作照合

大澤遼平, 渡辺裕

情報処理学会オーディオビジュアル複合情報処理研究会 AVM-116 ( No. 5 ) 1 - 3 2022年02月
正常画像を用いた以上画像検知手法の検討

山野井祐介, 原潤一, 渡辺裕

情報処理学会オーディオビジュアル複合情報処理研究会 AVM-116, ( No.7 ) 1 - 4 2022年02月
GAN Using Capsule Network for Discriminator and Generator

Kanako Marusaki, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2021) ( OS-AIP(2) ) 726 - 729 2021年10月 [査読有り]

DOI
An Image Inpainting Method Considering Edge Connectivity of Defects

Marika Arimoto, Junichi Hara, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2021) ( OS-TMR ) 127 - 128 2021年10月 [査読有り]

DOI
Face Image Generation for Illustration by WGAN-GP Using Landmark Information

Miho Takahashi, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2021) ( OS-DSC ) 989 - 990 2021年10月 [査読有り]
Text Image Super Resolution Using Deep Attention Neural Network

Yun Liu, Remina Yano, Hiroshi Watanabe, Takuya Suzuki, Takeshi Chujoh, Tomohiro Ikai

IEEE Global Conference on Consumer Electronics (GCCE2021) ( OS-TMR(3) ) 306 - 308 2021年10月 [査読有り]

DOI
Variant Graph Convolutional Networks for Skeleton-Based Hand Action Recognition

Khin Sabai Htwe, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2021) ( OS-AIP(2) ) 730 - 731 2021年10月 [査読有り]

DOI
Multi Frame Super-Resolution for Versatile Video Coding

Toshiya Hori, Remina Yano, Hiroshi Watanabe, Takeshi Chujoh, Tomohiro Ikai, Eiichi Sasaki, Takuya Suzuki, Norio Ito

SPIE Procedings for IWAIT2021 ( 11766 ) 1 - 6 2021年04月 [査読有り]

DOI
Creation of 3D Environmental Map using Omnidirectional Camera Images

Yuya Kato, Junichi Hara, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2021) ( OS-VDP ) 522 - 525 2021年03月 [査読有り]

DOI
Generation and Extraction of Color Palettes with Adversarial Variational Auto Encoders

Ahmad Moussa, Hiroshi Watanabe

6th International Congress on Information and Communication Technology (ICICT2021) 1 - 8 2021年02月 [査読有り]

DOI
A Study of Video Super-Resolution Method Using Video Coded Data as Training Data

Remina Yano, Yun Liu, Hiroshi Watanabe, Takuya Suzuki, Takeshi Chujoh, Tomohiro Ikai

2021 IEEE 10th Global Conference on Consumer Electronics, GCCE 2021 278 - 279 2021年 [査読有り]

　概要を見る

This paper presents a study of video super resolution method by applying Deformable Convolution network to coded video. It is reported the effectiveness of using coded video as training data, and numerical and visual results of coded fine-tuned model. From those results, it is discussed the relationship between about characteristic of training data and especially in video's framerate. There it is shown that video sequence which has similar framerate with training data can perform higher PSNR, since low framerate means the motion between frames is large and Deformable Convolution can learn that large motion.

DOI
Multi Frame Super-Resolution for Versatile Video Coding

Toshiya Hori, Remina Yano, Hiroshi Watanabe, Takeshi Chujoh, Tomohiro Ikai, Eiichi Sasaki, Takuya Suzuki, Norio Ito

International Workshop on Advanced Image Technology (IWAIT2021) ( 45 ) 2021年01月 [査読有り]

DOI
超解像とMPEG NNR 深層学習メタデータを利用した符号化映像伝送

鈴木拓矢, 佐々木瑛一, 中條健, 猪飼知宏, 渡辺裕

2020画像符号化シンポジウム ( P3-B-1 ) 2020年11月
Pitching Motion Matching Based on Pose Similarity Using Dynamic Time Warping

Ryohei Osawa, Takaaki Ishikawa, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2020) 140 - 144 2020年09月 [査読有り]

DOI
Pseudo Ground Truth Segmentation Mask to Improve Video Prediction Quality

Muchien Hsu, Juichun Shyur, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2020) 151 - 152 2020年09月 [査読有り]

DOI
Facial Age Estimation by Curriculum Learning

Wei Wang, Takaaki Ishikawa, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2020) 203 - 204 2020年09月 [査読有り]

DOI
ECNet: A Fast, Accurate, and Lightweight Edge-Cloud Network System Based on Cascading Structure

Libo Hu, Tao Wang, Hiroshi Watanabe, Shohei Enomoto, Xu Shi, Akira Sakamoto, Takeharu Eda

IEEE Global Conference on Consumer Electronics (GCCE2020) 259 - 262 2020年09月 [査読有り]

DOI
Animal Behavior Classification Using DeepLabCut

Shiori Fujimori, Takaaki Ishikawa, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2020) 259 - 262 2020年09月 [査読有り]

DOI
Study on Improvement of Estimation Accuracy in Pose Estimation Model Using Time Series Correlation

Atsuya Yamakawa, Takaaki Ishikawa, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2020) 570 - 573 2020年09月 [査読有り]

DOI
Rollover Detection of Infants Using Posture Estimation Mode

Ayaka Okuno, Takaaki Ishikawa, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2020) 688 - 691 2020年09月 [査読有り]

DOI
Self-Attention Based Neural Network for Few Shot Classification

Ravi Jain, Hiroshi Watanabe

IEEE Global Conference on Consumer Electronics (GCCE2020) 739 - 740 2020年09月 [査読有り]

DOI
Transfer Rate Estimation in Edge-Cloud Neural Network Solution for Object Detection

Libo Hu, Tao Wang, Yucheng Zhou, Hiroshi Watanabe, Shohei Enomoto, Xu Shi, Akira Sakamoto, Takeharu Eda

IEICE General Conference ( D-11-20 ) 2020年03月
機械学習に基づくマルチステップ超解像を応用した高解像度化処理手法

矢野仁愛, 渡辺裕

電子情報通信学会総合大会 ( D-11-23 ) 2020年03月
符号化ノイズを含む画像へのPSNR適応超解像

堀隼也, 龚子臣, 梅田聖也, 渡辺裕, 中條健, 猪飼知宏, 佐々木瑛一, 伊藤典男

電子情報通信学会総合大会 ( D-11-33 ) 2020年03月
Exit-Point Setting in Edge-Cloud Solution for Object Detection

Tao Wang, Libo Hu, Hiroshi Watanabe, Shohei Enomoto, Xu Shi, Arkira Sakamoto, Takeharu Eda

IEICE General Conference ( D-12-14, ) 2020年03月
Expression Recognition Using Curriculum Learning Method

Wei Wang, Takaaki Ishikawa, Hiroshi Watanabe

IEICE General Conference ( D-12-49 ) 2020年03月
An Automatic Music Arrangement System Using Machine Learning

Tianyi Gao, Hangyu Song, Ahmad Moussa, Hiroshi Watanabe

IEICE General Conference ( D-14-3 ) 2020年03月
時系列相関性を用いた姿勢推定モデルの精度向上

山川敦也, 石川孝明, 渡辺裕

情報処理学会全国大会 ( 1R-05 ) 2020年03月
OpenPoseを用いた投球シーン照合の一検討

大澤遼平, 石川孝明, 渡辺裕

情報処理学会全国大会 ( 1R-06 ) 2020年03月
Image Quality Enhancement with Machine Learning Based Multi-Step Super-Resolution

Niai Yano, Hiroshi Watanabe

IEEE The 2nd International Conference on Artificial Intelligence in Information and Communication (ICAIIC 2020) ( 3B-6 ) 1 - 5 2020年02月 [査読有り]

DOI
Audio Translation with Conditional Generative Adversarial Networks

Ahmad Yunis Moussa, Hiroshi Watanabe

IEEE The 2nd International Conference on Artificial Intelligence in Information and Communication (ICAIIC 2020) ( 9B-1 ) 1 - 5 2020年02月 [査読有り]

DOI
Automatic Classification of Manga Characters using Density-Based Clustering

H. Yanagisawa, K. Kyogoku, R. Jain, H. Watanbe

International Workshop on Advanced Image Technology, IWAIT 2020 ( No. 20 ) 1 - 6 2020年01月 [査読有り]
CNN-based Super-Resolution Adapted to Quantization Parameters

T. Hori, Z. Gong, H. Watanabe, T. Ikai, T. Chujoh, E. Sasaki, N. Ito

International Workshop on Advanced Image Technology, IWAIT 2020 ( No. 42 ) 1 - 5 2020年01月 [査読有り]
A Selective Fusion Module for Video Super Resolution with Recurrent Architecture

Z. Gong, T. Hori, H. Watanabe, T. Ikai, T. Chujoh, E. Sasaki, N. Ito

International Workshop on Advanced Image Technology, IWAIT 2020 ( No. 43 ) 1 - 6 2020年01月 [査読有り]
OpenPoseとDynamic Time Warpingを用いた投球シーン照合の一検討

大澤遼平, 石川孝明, 渡辺裕

2019年映像情報メディア学会冬季大会 ( 11A3 ) 2019年12月
異なったピッチングスタイルに対するシーン照合の一検討

大澤遼平, 石川孝明, 渡辺裕

2019年映像情報メディア学会スポーツ情報処理時限研究会・メディア工学研究会 ME2019-117 ( Vol.43, No.39 ) 13 - 14 2019年11月
Capsule Network を生成器に使用した敵対的生成ネットワークの一検討

丸㟢佳奈子, 渡辺裕

2019年画像符号化シンポジウム・2019年映像メディア処理シンポジウム ( P-1-03 ) 2019年11月
Auto Arrangement and Threshold to Database for Face Recognition

MuChien Hsu, Ju Chun Shyur, Hiroshi Watanabe

( P-1-11 ) 2019年11月
Few Shot Classification by Learning to Learn

Ravi Jain, Hideaki Yanagisawa, Hiroshi Watanabe

2019年画像符号化シンポジウム・2019年映像メディア処理シンポジウム ( P-1-15 ) 2019年11月
敵対的生成ネットワークを用いた和菓子画像生成の検討

大山優香, 渡辺裕

2019年11月
全天球カメラを用いた 3 次元再構成と点群の位置合わせ手法の一検討

加藤裕也, 原潤一, 渡辺裕

2019年画像符号化シンポジウム・2019年映像メディア処理シンポジウム ( P-3-17 ) 2019年11月
密度ベースクラスタリングを用いた漫画キャラクタ顔画像の分類に関する一検討

柳澤秀彰, 京極健悟, Ravi Jain, 渡辺裕

2019年画像符号化シンポジウム・2019年映像メディア処理シンポジウム ( P-4-07 ) 2019年11月
Two-side Network for Person Detection and Person Re-identification

Libo Hu, Tao Wang, Hiroshi Watanabe

2019年画像符号化シンポジウム・2019年映像メディア処理シンポジウム ( P-4-12 ) 2019年11月
縮小拡大と符号化によるノイズを含む画像の超解像に関する検討

堀隼也, 龚子臣, 梅田聖也, 渡辺裕, 中條健, 佐々木瑛一, 猪飼知宏, 伊藤典男

2019年画像符号化シンポジウム・2019年映像メディア処理シンポジウム ( P-4-12 ) 2019年11月
Land cover classification and change detection analysis of multispectral satellite images using machine learning

Nyein Soe Thwal, Takaaki Ishikawa, Hiroshi Watanabe

Image and Signal Processing for Remote Sensing XXV 56 - 56 2019年10月 [査読有り]

DOI
漫画キャラクタ顔画像のクラスタリングにおける特徴表現の改良

柳澤, 京極, Jain, 渡辺

FIT2019 (第18回科学技術フォーラム) H-029 2019年09月
Hand Joints Detection on Noisy Hand Poses Using Variational Autoencoder

Khin Sabai Htwe, Takaaki Ishikawa, Hiroshi Watanabe

The 6th IIEEJ International Conference on Image Electronics and Visual Computing (IEVC 2019) 4B-2 2019年08月 [査読有り]
全天球カメラ映像からの3次元再構成と点群合成手法の一検討

加藤, 原, 渡辺

映像情報メディア学会年次大会 34B-3 2019年08月
学習型超解像と再構成型超解像の異なる組み合わせ順による時系列画像の画質改善

堀, 加藤, 渡辺

映像情報メディア学会年次大会 32B-1 2019年08月
スポーツシーン照合に関する一検討

大澤, 石川, 渡辺

映像情報メディア学会年次大会 31C-3 2019年08月
Deep Learning Based Speaker Recognition System

Hangyu Song, Hiroshi Watanabe

ITE Annual Conference 13B-1 2019年08月
Analyzing Hand Action Recognition using Spatial Temporal Graph Convolutional Networks (ST-GCN)

Khin Sabai Htwe, Takaaki Ishikawa, Hiroshi Watanabe

ITE Annual Conference 11B-3 2019年08月
画像変換による全天球カメラ映像のLSD-SLAMへの適用と点群合成

加藤, 原, 渡辺

情報処理学会オーディオ・ビジュアル複合情報処理（AVM）研究会 AVM105, No.14, pp.1-4 2019年06月
点群ノイズ処理のための点群特徴抽出

赤塚, 原, 渡辺

情報処理学会オーディオ・ビジュアル複合情報処理（AVM）研究会 AVM105, No.13, pp.1-4 ( 16 ) 73 - 75 2019年06月

CiNii
Manga character clustering with DBSCAN using fine-tuned CNN model

Hideaki Yanagisawa, Takuro Yamashita, Watanabe Hiroshi

International Workshop on Advanced Image Technology (IWAIT) 2019 24 - 24 2019年03月 [査読有り]

DOI
骨格推定と機械学習を利用した日本指文字の分類

小林, 石川, 渡辺

電子情報通信学会総合大会 D-12-62 2019年03月
機械学習による日本語話者の自動読唇の基礎検討

浅見, 石川, 渡辺

電子情報通信学会総合大会 D-12-61 2019年03月
機械学習を用いた複数ダンサーの3次元基準モデル作成

稲田, 石川, 渡辺

電子情報通信学会総合大会 D-12-60 2019年03月
スポーツ映像から取得した骨格座標データに対するアラインメント

横井, 石川, 渡辺

電子情報通信学会総合大会 D-12-59 2019年03月
GANによる和菓子画像の生成と評価

大山, 渡辺

電子情報通信学会総合大会 D-12-40 2019年03月
点群特徴抽出に関する一検討

赤塚, 原, 渡辺

電子情報通信学会総合大会 D-12-39 2019年03月
全天球画像における対応直線検出

青木, 原, 渡辺

電子情報通信学会総合大会 D-12-36 2019年03月
LSD-SLAMを用いた全天球カメラ映像からの３次元再構成

加藤, 原, 渡辺

電子情報通信学会総合大会 D-12-35 2019年03月
単眼RGBカメラを用いた手話の動作解析

柳澤, 石川, 渡辺

電子情報通信学会総合大会 D-12-34 2019年03月
Content-Independent Speaker Recognition System based on Neural Networks

H. Song, H. Watanabe

IEICE General Conference D-12-96 2019年03月
A Study on GAN Using Capsule Network

K. Marusaki, H. Watanabe

IEICE General Conference D-12-8 2019年03月
超解像によるビット深度拡張の特性

梅田, 渡辺, 猪飼, 橋本, 中條, 伊藤

電子情報通信学会総合大会 D-12-7 2019年03月
マルチステップ超解像技術による高画質化処理

矢野, 渡辺

電子情報通信学会総合大会 D-11-9 2019年03月
超解像を用いた時系列画像の画質改善

堀, 渡辺

電子情報通信学会総合大会 D-11-6 2019年03月
Comparison of Machine Learning Classifiers for Land Cover Mapping in Google Earth Engine

Nyein Soe Thwal, T. Ishikawa, H. Watanabe

IEICE General Conference BS-4-21 2019年03月
An Evaluation of The Impact of Dataset Bias in Pretrained VGG Network on The Performance of Neural Network Based Style Transfer

Z. Gong, H. Watanabe

IEICE General Conference BS-4-19 2019年03月
Complex Character Retrieval from Comics using Deep Learning

R. Jain, H. Watanabe

IEICE General Conference BS-4-18 2019年03月
主成分分析を用いた点群特徴抽出に関する一検討

赤塚, 原, 渡辺

情報処理学会オーディオビジュアル複合情報処理研究会 AVM104-003 2019年02月
スポーツ情報処理と映像表現への期待

渡辺裕

NHK技研 R&D No.173, pp.2-3 2019年01月
主要キャラクタの抽出を目的とした漫画キャラクタ画像のクラスタリング

柳澤, 山下,渡辺

映像情報メディア学会オンライン・ジャーナル Vol. 73, No.1, pp.199-204 2019年01月
Automatic Self-Improvement Scheme in Optical Flow-Based Motion Estimation for Sequential Fisheye Images

Arief Suryadi Satyawan, Junichi Hara, Hiroshi Watanabe

ITE Transactions on Media Technology and Applications Vol. 7, No. 1, pp.20-35 ( 1 ) 20 - 35 2019年01月 [査読有り]

DOI
Joint Super-Resolution and Bit Depth Extention by DNN

S. Umeda, H. Watanabe, T. Ikai, T. Hashimoto, T. Chujo, N. Ito

IWAIT-FMIA 2019 No.201 2019年01月 [査読有り]

DOI
スポーツ映像から抽出した時系列多次元データのアラインメント

横井, 石川, 渡辺

映像情報メディア学会スポーツ情報処理時限研究会 ME2018-115 (SIP2018-03-12), pp.43-46 2018年12月
縦方向の回転運動に対するOpenPose

川島, 石川, 渡辺

映像情報メディア学会スポーツ情報処理時限研究会 ME2018-110 (SIP2018-03-07), pp.25-28 2018年12月
OpenPoseを用いた複数ダンサーの姿態評価の基礎検討

稲田, 石川, 渡辺

映像情報メディア学会スポーツ情報処理時限研究会 ME2018-109 (SIP2018-03-06), pp.21-24 2018年12月
Mapping Urban Land Cover by Using Landsat Sentinel Images and Open Social Data

Nyein Soe Thwal, Takaaki Ishikawa, Hiroshi Watanabe

ITE Winter Annual Convention 22D-4 2018年12月
多段超解像処理による視覚的な画質改善

矢野, 渡辺

映像情報メディア学会冬季大会 14D-5 2018年12月
符号化画像に対する超解像処理の特性改善について

梅田, 矢野, 渡辺

映像情報メディア学会冬季大会 14D-4 2018年12月
OpenPoseを用いた回転不変動作の姿勢推定

川島, 石川, 渡辺

映像情報メディア学会冬季大会 13D-3 2018年12月
OpenPoseを用いた複数ダンサーの動作の同期および姿態評価

稲田, 石川, 渡辺

映像情報メディア学会冬季大会 13D-2 2018年12月
スポーツ映像における動作類似度を利用したイベント認識

横井, 石川, 渡辺

映像情報メディア学会冬季大会 13D-1 2018年12月
Improving Detection of Hand Joints in RGB Images using Maximum Confidence Value

Khin Sabai Htwe, Takaaki Ishikawa, Hiroshi Watanabe

ITE Winter Annual Convention 12D-5 2018年12月
生成画像品質を考慮したCapsGANによるデータ拡張

松田, 丸嵜, 渡辺

映像情報メディア学会冬季大会 12D-4 2018年12月
Comparison of CNN based Illustration Drawing-Style Classification Systems

Hangyu Song, Hiroshi Watanabe

ITE Winter Annual Convention 12D-3 2018年12月
全天球画像特徴を用いた3D点群ノイズ整形

赤塚, 原, 渡辺

映像情報メディア学会冬季大会 11D-5 2018年12月
全天球画像を用いたカメラ自己位置推定技術と環境地図作成の同時処理技術

青木, 渡辺, 原

映像情報メディア学会冬季大会 11D-4 2018年12月
OpenPoseを用いた複数ダンサーの動作のずれ評価の基礎検討

稲田, 石川, 渡辺

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-5-7 2018年11月
複数の出力画像を用いた機械学習に基づく超解像処理技術

矢野, 渡辺

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-4-8 2018年11月
全天球カメラを用いた自己位置推定と環境地図作成の基礎検討

青木, 渡辺, 原

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-4-7 2018年11月
全天球画像特徴を用いた3D点群境界検出

赤塚, 原, 渡辺

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-3-9 2018年11月
機械学習による日本語話者の自動読唇の基礎検討

浅見, 石川, 渡辺

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-3-8 2018年11月
Hand Joints Detection Using Initial Hand Detector on Egocentric Views

Khin Sabai Htwe, Takaaki Ishikawa, Hiroshi Watanabe

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-3-3 2018年11月
スポーツ映像における身体座標情報を利用したイベント認識

横井, 石川, 渡辺

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-2-10 2018年11月
回転動作に対するOpenPoseを用いた回転不変の姿勢推定

川島, 石川, 渡辺

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-2-7 2018年11月
Land Cover Classification Using Landsat and Sentinel Images by Integrating Pixel and Object-based Segmentation Approach

Nyein Soe Thwal, Takaaki Ishikawa, Hiroshi Watanabe

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-2-3 2018年11月
Comparison of CNN based Illustration Drawing-Style Classification Systems

Hangyu Song, Hiroshi Watanabe

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-1-10 2018年11月
映像符号化における超解像処理画像の画質改善

梅田, 矢野, 渡辺

2018年画像符号化シンポジウム・2018年映像メディア処理シンポジウム（PCSJ/IMPS2018) P-1-8 2018年11月
コミックのセリフの感情分析

盛, 渡辺

電子情報通信学会ソサイティ大会 A-10-18 2018年09月
The Performance of Self-Enhancement Method in Optical Flow-Based Motion Estimation for Sequential Fisheye Images with Sudden Movement of the Objects

Arief Suryadi Satyawan, Junichi Hara, Hiroshi Watanabe

IEICE Society Conference BS-7-8 2018年09月
CNNを用いた漫画キャラクタ顔画像クラスタリングシステムの改良に関する位置検討

柳澤, 山下, 渡辺

映像情報メディア学会年次大会 12B-2 2018年08月
HDR video super-resolution for future video coding

Seiya Umeda, Niai Yano, Hiroshi Watanabe, Tomohiro Ikai, Takeshi Chujoh, Norio Ito

2018 International Workshop on Advanced Image Technology, IWAIT 2018 1 - 4 2018年05月 [査読有り]

　概要を見る

New coding tools are emerging in Future Video Coding (FV C). A novel video coding system, which combines FVC-era coding tools with super-resolution is proposed, aiming to demonstrate the state of the art coding results in ultra-high definition video coding. Experimental results show the proposed system outperforms convention system in both SDR and HDR images.

DOI
Scene Flow Estimation from Squential Stereo Fisheye Images

Arief Suryadi Satyawan, Junichi Hara, Hiroshi Watanabe

IEEE International Conference on Signals and Systems 2018, (ICSygSys2018) DSP2-5 2018年05月 [査読有り]

DOI
Data Driven Cyber-Physical System for Landslide Detection

Zhi Liu, Toshitaka Tsuda, Hiroshi Watanabe, Satoko Ryuo, Nagateru Iwasawa

Mobile Networks and Applications No.11036 1 - 12 2018年03月

　概要を見る

Natural disaster is one of the most important research topics worldwide. In this paper, a data driven cyber-physical system is introduced to detect landslides. This system is composed of Wi-Sun acceleration sensors, which can detect the acceleration of the nearby environment in 3D domain, and the sensors are linked with the router (act as ’sink’ node) via Wi-Sun transmission (i.e. IEEE802.15.4g). The details of the detection system are explained and the landslide detection mechanism with low computational complexity is proposed. A traffic reduction method is proposed thereafter to help reduce the data needed for transmission by exploring the intra-correlations of the sensor data. This method can save the energy consumption without degrading the detection performance. Field test is conducted and the results show that the landslide can be detected and amount of data to be transmitted can be reduced, which verifies the system’s effectiveness.

DOI
深層学習を用いたマンガキャラクタの検出における顔変形の影響評価

山下, 柳澤, 渡辺

情報処理学会全国大会 5Y-03 2018年03月
全天球カメラを用いた位置推定手法及び距離測定手法の検討

青木, 渡辺, 宋

情報処理学会全国大会 2U-05 2018年03月
LiDARと全天球画像を用いた三次元再構成

赤塚, 渡辺

情報処理学会全国大会 2U-04 2018年03月
FVC量子化パラメタ制御による超解像の特性

梅田, 矢野, 渡辺, 猪飼, 中條, 伊藤

情報処理学会全国大会 2U-03 2018年03月
方向性を考慮した超解像処理技術によるFVC符号化画像の画質改善

矢野, 梅田, 渡辺, 猪飼, 中條, 伊藤

情報処理学会全国大会 2U-02 ( 1 ) 259 - 260 2018年03月

　概要を見る

近年,出力機器における解像度が増加し,より高精細な画像の出力が求められるようになってきている.この高精細な画像を取得する手法として,現在,長年研究されてきた超解像処理技術を符号化技術に適応する手法が研究され始めている.超解像処理技術とは入力された低解像度画像中に欠損した高周波成分を付加することによって高解像度画像を取得する技術であり,中でも学習型超解像は高画質な画像を取得できる手法として知られている.我々はこの学習型超解像であるAplus,ScSR,SRCNNについて方向性を考慮した超解像処理を行い,これらをH.265符号化画像に適応する手法を考案し,符号化画像の更なる画質向上について検討する.

CiNii
OpenPoseを用いた複数人のダンスの一致度評価

稲田, 渡辺, 石川

情報処理学会全国大会 2U-01 2018年03月
野球選手のスイング解析による打撃成績予測

横井, 石川, 渡辺

電子情報通信学会総合大会 D-12-45 2018年03月
姿勢推定を用いた歩きスマホ検出に用いる歩行検出手法の改善

加藤, 渡辺

電子情報通信学会総合大会 D-12-44 2018年03月
Prediction of Next Day’s Sea-Level Pressure by Support Vector Regression

Tin Nilar Lin, Hiroshi Watanabe

IEICE General Conference D-20-11 2018年03月
Instance Segmentation on Omni-directional Images Based on Mask R-CNN

Mengcheng Song, Junichi Hara, Hiroshi Watanabe

IEICE General Conference BS-2-13 2018年03月
An Improvement of Motion Estimation on Sequential Fisheye Images

Arief Suryadi Satyawan, Junichi Hara, Hiroshi Watanabe

IEICE General Conference BS-2-12 2018年03月

J-GLOBAL
Robust 3D Reconstruction with Omni-directional Camera based on Structure from Motion

Mengcheng Song, Junich Hara, Hiroshi Watanabe

International Workshop on Advanced Image Technology (IWAIT2018) No.105, pp.1-4 2018年01月 [査読有り]
Deep Learning Based Super-Resolution for Future Video Coding

Seiya Umeda, Niai Yano, Hiroshi Watanabe, Tomohiro Ikai, Takeshi Chujoh, Norio Ito

International Workshop on Advanced Image Technology (IWAIT2018) No.89, pp.1-4 2018年01月 [査読有り]
Scene Flow from Stereo Fisheye Images

Arief Suryadi Satyawan, Junichi Hara, Hiroshi Watanabe

International Workshop on Advanced Image Technology (IWAIT2018) No.77, pp.1-4 2018年01月 [査読有り]
A Study on Object Detection Method from Manga Images Using CNN

Hideaki Yanagisawa, Takuro Yamashita, Hiroshi Watanabe

International Workshop on Advanced Image Technology (IWAIT2018) No.16, pp.1-4 2018年01月 [査読有り]
姿勢推定と把持物体認識を用いた歩きスマホ認識手法の検討

加藤,渡辺

2017年映像情報メディア学会冬季大会 22B ( 2 ) 2017年12月
関節位置情報を用いた自転車選手識別手法に関する一検討

高木, 石川,渡辺

2017年映像情報メディア学会冬季大会 25C ( 5 ) 2017年12月
マンガキャラクター顔画像クラスタリングの改良のおける一検討

柳澤, 山下,渡辺

2017年映像情報メディア学会冬季大会 22B ( 8 ) 2017年12月
画像の方向性を考慮した超解像処理技術

矢野,梅田, 渡辺,猪飼, 中條,伊藤

2017年映像情報メディア学会冬季大会 14B ( 2 ) 14B-2 2017年12月

DOI
Weather data estimation by sensitive features selection

Tin Nilar Lin, H. Watanabe

2017 ITE Winter Annual Convention 13B ( 6 ) 2017年12月
多段ニューラルネットワークを用いた動物の品種識別

渡部, 渡辺

2017年映像情報メディア学会冬季大会 13B ( 1 ) 2017年12月
CNNを用いたマンガオブジェクト検出手法の比較

柳澤,渡辺

2017年画像符号化シンポジウム・2017年映像メディア処理シンポジウム（PCSJ/IMPS2017) P1 ( 10 ) 2017年11月
深層学習福笑い

山下, 柳澤,渡辺

2017年画像符号化シンポジウム・2017年映像メディア処理シンポジウム（PCSJ/IMPS2017) P5 ( 2 ) 2017年11月
姿勢情報を用いた自転車選手識別手法に関する一検討

高木, 石川,渡辺

映像情報メディア学会メディア工学研究会（スポーツ情報処理時限研究会合同開催） ME2017 ( 121 ) 25 - 27 2017年11月
畳込みニューラルネットによるマンガオブジェクト認識メカニズムの一検討

柳澤, 山下,渡辺

電子情報通信学会，パターン認識・メディア理解研究会 PRMU2017 ( 79 ) 2017年10月
V-SLAMとマーカARを併用したペン型デバイスによる空中描画システム

小林, 石川,渡辺

映像情報メディア学会年次大会 34B ( 3 ) 2017年09月
ステレオ全天球カメラからの距離計測の基礎検討

青木,松田, 原,渡辺

映像情報メディア学会年次大会 33B ( 5 ) 2017年09月

J-GLOBAL
姿勢情報を用いた自転車選手識別の基礎検討

高木, 石川,渡辺

映像情報メディア学会年次大会 14D ( 4 ) 2017年09月
関節角度情報を用いた歩きスマホ認識

加藤,渡辺

映像情報メディア学会年次大会映像情報メディア学会年次大会 13D ( 4 ) 2017年09月
ボロノイ図に基づく3次元優勢領域によるパスコース評価について

横井,浅見, 石川,渡辺

映像情報メディア学会年次大会 13D ( 3 ) 2017年09月
Ship Classification Using Faster Region Convolution Neural Network (Faster R-CNN) for Automatic Identification of Marine Vessels

K. J. Ngeno, H. Yanagisawa, H. Watanabe

FIT2017(第16回科学技術フォーラム) H ( 39 ) 2017年09月
生成画像品質を考慮したDCGANを用いたデータオーギュメンテーション

渡部,渡辺

FIT2017(第16回科学技術フォーラム) H ( 38 ) 2017年09月
姿勢推定による歩きスマホ認識のための歩行検出

加藤,渡辺

FIT2017(第16回科学技術フォーラム) H ( 36 ) 2017年09月
Water Level Prediction for Disaster Management Using Machine Learning Models

Tin Nilar Lin, H. Watanabe

FIT2017(第16回科学技術フォーラム) H ( 29 ) 2017年09月
Motion Estimation on Fish-Eye Images Using Modified Motion Model

Arief Suryadi Satyawan, J. Hara, H. Watanabe

FIT2017(第16回科学技術フォーラム) H ( 23 ) 2017年09月
Deep Learning特徴量を用いたマンガキャラクター顔画像の分類

柳澤,渡辺

FIT2017(第16回科学技術フォーラム) H ( 1 ) 2017年09月
姿勢推定による肘と肩の角度情報を用いた歩きスマホ認識

加藤, 渡辺

映像情報メディア学会スポーツ情報処理時限研究会 4 ( 7 ) 2017年07月
3次元優勢領域図を利用したパスコースの決定

横井, 浅見, 石川, 渡辺

映像情報メディア学会スポーツ情報処理時限研究会 4 ( 6 ) 2017年07月
[Foreword] Welcome to the Special Section on Sports-Information Technology

Watanabe Hiroshi

ITE Transactions on Media Technology and Applications 5 ( 2 ) 35 - 35 2017年04月

CiNii
Trajectory Visualization of Ego-Motion Videos with Pedestrian Based on Monocular Visual Odometry and Machine Learning

Y. Zhang, H. Watanabe

IEICE General Conference BS-1 ( 8 ) 2017年03月
Motion Estimation on Fish-Eye Images

A. S. Satyawan, J. Hara, H. Watanabe

IEICE General Conference BS-1 ( 9 ) 2017年03月
Ship Identification for Port Security using Faster Region Convolution Neural Network

K. J. Ngeno, Hiroshi Watanabe

IEICE General Conference BS-1 ( 12 ) 2017年03月
X-means法を用いたマンガキャラクターの自動分類に関する検討

柳澤, 渡辺

IEICE総合大会 D-12 ( 40 ) 2017年03月
Recognition of Panel Structure in Comic Images Using Faster R-CNN

H. Yahagisawa, H. Watanabe

International Workshop on Image Electronics and Visual Computing 2017 4C ( 2 ) 2017年03月 [査読有り]
Data Augmentation Using DCGAN for Breed Identification

H. Watabe, H. Watanabe

International Workshop on Image Electronics and Visual Computing 2017 4C ( 3 ) 2017年03月 [査読有り]
ボロノイ図を用いたサッカーの3次元優勢領域の決定

横井, 石川, 渡辺

映像情報メディア学会スポーツ情報処理時限研究会 3 ( 1 ) 2017年02月
Welcome to the special section on sports-information technology

Hiroshi Watanabe

ITE Transactions on Media Technology and Applications 5 ( 2 ) 35 2017年

DOI
Adaptive Video Streaming in Hybrid Landslide Detection System with D-S Theory

Zhi Liu, Kenji Kanai, Masaru Takeuchi, Toshitaka Tsuda, Hiroshi Watanabe

2017 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP) 352 - 357 2017年 [査読有り]

　概要を見る

Disaster detection is an important research topic and draws great attentions from both industry and academia. In this paper, we study the hybrid landslide detection system, which utilizes the video surveillance camera and multiple kinds of sensors and can detect the landslide automatically. Edge processing is adopted in this hybrid system to fuse the sensor data based on the Dempster-Shafer (D-S) theory, i.e. utilizing multiple sensors' information to calculate the possibility of the landslide. Edge processing can make faster decisions than the control center, the results of the edge processing are then used to schedule the sensor's transmission frequency and video transmission under the network constraints in this system. The simulation results show that the proposed scheme outperforms the competing schemes in typical network scenarios.
Adaptive Video Streaming in Hybrid Landslide Detection System with D-S Theory

Zhi Liu, Kenji Kanai, Masaru Takeuchi, Toshitaka Tsuda, Hiroshi Watanabe

2017 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP) 352 - 357 2017年 [査読有り]

　概要を見る

Disaster detection is an important research topic and draws great attentions from both industry and academia. In this paper, we study the hybrid landslide detection system, which utilizes the video surveillance camera and multiple kinds of sensors and can detect the landslide automatically. Edge processing is adopted in this hybrid system to fuse the sensor data based on the Dempster-Shafer (D-S) theory, i.e. utilizing multiple sensors' information to calculate the possibility of the landslide. Edge processing can make faster decisions than the control center, the results of the edge processing are then used to schedule the sensor's transmission frequency and video transmission under the network constraints in this system. The simulation results show that the proposed scheme outperforms the competing schemes in typical network scenarios.

DOI
Faster R-CNNを用いたマンガ画像の構造解析

柳澤, 渡辺

P-2 ( 10 ) 2016年11月
線路脇カメラと鉄道車載カメラによる障害物検出の検討

加藤, 渡辺

P-3 ( 16 ) 2016年11月
レンズ歪補正動き補償方式に関する一検討

高木, 渡辺, 河村, 内藤

P-3 ( 19 ) 2016年11月
DCGANによる生成画像品質の猫品種識別性能に及ぼす影響について

渡部, 渡辺

P-4 ( 16 ) 2016年11月
DCGANを用いたデータオーギュメンテーションによる猫の品種識別について

渡部, 渡辺

2016映像情報メディア学会年次大会 24D ( 5 ) 2016年09月
レンズ歪ブロックを用いた動き補償方式の検討

高木, 渡辺, 河村, 内藤

FIT2016(第15回科学技術フォーラム) I ( 033 ) 2016年09月
Faster R-CNNを用いたマンガ画像からのメタデータ抽出

柳澤, 渡辺

2016映像情報メディア学会年次大会 14B ( 1 ) 2016年09月
Trajectory data visualization of sports video based on SLAM

Y. Zhang, H. Watanabe

ITE Annual Convention 2016 34C ( 1 ) 2016年09月
BS-3-27 Empty Scene and Non-empty Scene Detection from Wi-Sun Acceleration Sensor(BS-3. Advanced Networking Technologies for Innovative Information Networks)

Liu Zhi, Sigg Stephan, Tsuda Toshitaka, Watanabe Hiroshi

電子情報通信学会総合大会講演論文集 2016 ( 2 ) "S - 64"-"S-65" 2016年03月

　概要を見る

Wireless sensor networks (WSNs) are widely deployed to help detect disaster such as landslides, etc. in countries like Japan due to their efficiency and low cost. But there are various problems in these systems, such as the different kind of noise (such as random visit from animals, which leads to unnecessary false alarms). Orthogonally, context aware activity recognition draws increasing attention recently, where different environmental or martial sensors are adopted for activity recognition in various application scenarios. This paper studies a disaster detection system utilizing the Wi-Sun acceleration sensors together with the Wi-Sun signal fluctuation and considers the detection of human beings in the scene of interest. Hence, the false alarms introduced by the animals' walk in, etc. could be detected using similar method. During the detection process, not only the wireless signal but also the acceleration value collected by the sensors are adopted for a better detection result. Then feature values are calculated for event detection based on the data samples. K nearest neighbors (KNN) is used to classify the events in the scene of interest: empty and non-empty. The detection results are promising as shown in the result section and the proposed method is applicable to a real landslide to avoid animal- induced false alarms.

CiNii
D-21-4 電子コミックトレーラーの作成手法の検討(D-21.マルチメディア情報ハイディング・エンリッチメント,一般セッション)

野口多紀, 渡辺裕

電子情報通信学会総合大会講演論文集 2016 ( 2 ) 217 - 217 2016年03月

CiNii
D-21-3 コミックにおける主要キャラクター同定の検討(D-21.マルチメディア情報ハイディング・エンリッチメント,一般セッション)

長尾一輝, 渡辺裕

電子情報通信学会総合大会講演論文集 2016 ( 2 ) 216 - 216 2016年03月

CiNii
D-12-12 マンガキャラクターのマルチビュー顔検出に関する検討(D-12.パターン認識・メディア理解A(パターンメディアの認識・理解・生成),一般セッション)

柳澤秀彰, 渡辺裕

電子情報通信学会総合大会講演論文集 2016 ( 2 ) 81 - 81 2016年03月

CiNii
D-11-53 レンズ歪み補正付き動き補償方式の検討(D-11.画像工学,一般セッション)

高木政徳, 渡辺裕, 河村圭, 内藤整

電子情報通信学会総合大会講演論文集 2016 ( 2 ) 53 - 53 2016年03月

CiNii
D-11-20 鉄道運行映像に対する背景差分法の適用(D-11.画像工学,一般セッション)

加藤君丸, 渡辺裕

電子情報通信学会総合大会講演論文集 2016 ( 2 ) 20 - 20 2016年03月

CiNii
Empty Scene and Non-empty Scene Detection from Wi-Sun Acceleration Sensor

Z. Liu, S. Sigg, T. Tsuda, H. Watanabe

IEICE General Conference BS-3 ( 27 ) 2016年03月
鉄道運行映像に対する背景差分法の適用

加藤, 渡辺

電子情報通信学会総合大会 D-11 ( 20 ) 2016年03月
レンズ歪み補正付き動き補償方式の検討

高木, 渡辺, 河村, 内藤

D-11 ( 53 ) 2016年03月
マンガキャラクターのマルチビュー顔検出に関する検討

柳澤, 渡辺

D-11 ( 12 ) 2016年03月
コミックにおける主要キャラクター同定の検討

長尾, 渡辺

D-21 ( 3 ) 2016年03月
電子コミックトレーラーの作成手法の検討

野口, 渡辺

電子情報通信学会総合大会 D-21 ( 4 ) 2016年03月
Convolutional Pose Machinesを用いた歩きスマホ認識

加藤, 渡辺

IEICE総合大会 D-12 ( 56 ) 4724 - 4732 2016年 [査読有り]

DOI
R-CNNを用いたマンガキャラクター検出に関する一検討

柳澤, 渡辺

I-4 ( 12 ) 1 - 2 2015年11月
Cat and dog face detection using deformable part model

H. Watanabe, H. Zhang

PCSJ-IMPS Symposium I-3 ( 12 ) 1 - 2 2015年11月
マンガ要約手法に関する一考察

藤岡, 渡辺

電子情報通信学会ソサイティ大会 A20 ( 11 ) 2015年09月
A-20-11 マンガ要約手法に関する一考察(A-20.スマートインフォメディアシステム,一般セッション)

藤岡恭平, 渡辺裕

電子情報通信学会基礎・境界ソサイエティ/NOLTAソサイエティ大会講演論文集 2015 151 - 151 2015年08月

CiNii
Sensor Data Characteristics in the Wireless Train Track Unusual Notification Sensor Network

Z. Liu, T. Tsuda, H. Watanabe

IEICE Technical Report on Communication Quality CQ2015 ( 39 ) 101 - 106 2015年07月

CiNii
An effective normalization algorithm at deep learning for passenger detection in railway operation videos

J. Zhang, H. Watanabe

The 1st International Conference on Advanced Imaging ( T109-2 ) 2015年06月 [査読有り]
多層畳み込みニューラルネットワークを用いた人物認識における正規化効果

張, 石井, 渡辺

電子情報通信学会総合大会 D-12-24 2015年03月
マンガの複数キャラクターに対する顔検出率について

柳澤, 石井, 渡辺

電子情報通信学会総合大会 D-12-31 2015年03月
ディープラーニングを用いたオブジェクト認識における整流・マックスプーリング効果

張, 石井, 渡辺

情報処理学会全国大会 2P-08 2015年03月
マンガの複数キャラクターに対する顔検出率について

柳澤秀彰, 石井大祐, 渡辺裕

電子情報通信学会総合大会講演論文集 D-12 ( 31 ) 83 - 83 2015年02月

CiNii
Effect of Normalization at Deep Convolutional Neural Networks for Human Recognition

張傑, 石井大祐, 渡辺裕

電子情報通信学会総合大会講演論文集 D-12 ( 24 ) 76 - 76 2015年02月

CiNii
Traffic Deduction Exploring Sensor Data's Intra-Correlations in Train Track Monitoring WSN

Zhi Liu, Toshitaka Tsuda, Hiroshi Watanabe

2015 IEEE SENSORS 7 ( 171 ) 1712 - 1715 2015年 [査読有り]

　概要を見る

WSNs are good options to help monitor the scene of interest and notify the unusual happening to control center. But sensors' high sampling rates lead to tremendous network traffic over the bandwidth-limited and energy-critical WSNs, hence how to reduce the network traffic while maintaining the unusual events monitoring function becomes important. In this paper, we investigate the intra-correlations of the data generated by each sensor at different time instances. And we propose a traffic deduction algorithm exploring the sensor data's intra-correlations which could reduce the data volume significantly and guarantee the parameters needed for unusual detection are delivered.
マンガの構成要素に基づく自動シーン分割処理に関する一検討

石井, 柳澤, 三原, 永森, 渡辺

情報処理学会AVM研究会研究報告 Vol.2014-AVM87 ( No.15 ) 1 - 4 2014年12月
A basic study on human detection for train operation video

J. Zhang, H. Watanabe

PCSJ-IMPS 2014 I-5-01 2014年11月
放送の「こと」づくり

渡辺裕

映像情報メディア学会誌 68 ( 10 ) k18 - k18 2014年10月

DOI CiNii
2値線画像解析のための2値方向パターンヒストグラム特徴量HBOP

石井, 渡辺

早稲田大学GITS/GITI紀要2013-1014 33 - 44 2014年10月

CiNii
Face detection for comic images with deformable part model

H. Yanagisawa, D. Ishii, H. Watanabe

The 4th International Workshop on Image Electronics and Visual Computing 2014 (IEVC2014) 4A-1 ( 2 ) 95 - 100 2014年10月

CiNii
マンガにおけるHOG+AdaBoostによる顔画像検出の性能評価

陳, 柳澤, 張, 石井, 渡辺

映像情報メディア学会年次大会 17-4 17 - 4-1_-_17-4-2_ 2014年09月

　概要を見る

In recent years, Japanese comics have been very popular in the world. In this paper, we applied the HOG and AdaBoost algorithm to detect character's face in comic images. We have recognized that learning process requires at least 80 times repeatedly and the detection success rate converges in about 65%.

DOI CiNii
マンガ画像からの顔検出におけるパーツ特徴量の一検討

柳澤, 石井, 陳, 渡辺

映像情報メディア学会年次大会 17-9 17 - 9-1_-_17-9-2_ 2014年09月

　概要を見る

In recent years, studies of extracting meta-data from comic images has been focused on for query and search application of e-comics. In this paper, we propose to apply Deformable Part Model, which is a detection method using parts placement of an object, to detect character' faces in comic images.

DOI CiNii
A Note on Face Detection of Comic Image with Different Background

M. Chen, H. Yanagisawa, D. Ishii, H. Watanabe

2014 IEICE General Conference BS-1-47 1 - 2 2014年03月
A note on content caching and delivery based on AVC and SVC

M. Takafuji, H. Nakazato, H. Watanabe

情報処理学会AVM研究会研究報告 Vol.2014-AVM84 ( No.3 ) 1 - 5 2014年02月

　概要を見る

Content cache effectiveness in video distribution is mathematically analyzed. Targeted video distribution models are two types. One is simulcast, the other is scalable approach. Simple two layer model is used for the analysis. Caching effect is studied focusing on the amount of required transmission data based on some assumptions with a certain cache hit probability. The result shows that the scalable approach cannot improve the caching effect.Content cache effectiveness in video distribution is mathematically analyzed. Targeted video distribution models are two types. One is simulcast, the other is scalable approach. Simple two layer model is used for the analysis. Caching effect is studied focusing on the amount of required transmission data based on some assumptions with a certain cache hit probability. The result shows that the scalable approach cannot improve the caching effect.

CiNii
2値線画特徴量を用いたマンガ画像解析に関する一検討

石井, 張, 石上, 渡辺

情報処理学会AVM研究会研究報告 Vol.2014-AVM84 ( No.5 ) 1 - 4 2014年02月

　概要を見る

マンガは 2 値線画からなる複雑な模様によって描かれている．従来，画像のエッジをとらえる画像特徴量として，HOG が使用されており，マンガにも一定の解析結果をもたらすことがわかってきている．その一方で，2 値の線画を扱ううえで，HOG 特徴量では形状をとらえるために冗長な次元がある．また，2 値の斜線は縦横 2 方向の合成によって構成されることから，方向の取得にも難点がある．本稿では我々の提案する 2 値線画像用に作成した特徴量 Histograms of Binary Oriented Pattern(HBOP) を用いてマンガ画像の解析を行う．マンガ画像解析において従来の HOG 特徴量と HBOP を用いた場合の比較結果を示す．Comic image was drawn into complex pattern using binary line. In conventional research, HOG it can treat image edge was utilized for to analyze Comic image as image feature. On the other hand HOG feature has redundant dimension for to treat binary line image. Moreover, diagonal line on binary image was consisted by combination of vertical and horizontal edge. Thus it is hard to get line angle. Histograms of Binary Oriented Pattern(HBOP) was proposed for to analyze binary line image by us. In this research, We analyze comic image using HBOP. Then, comparison results between HOG with HBOP on Comic image analysis was derived by examination.

CiNii
“HTML5 CanvasおよびSVGにおける自動選択アルゴリズムを用いた描画パフォーマンスの最適化

山本, 渡辺

情報処理学会AVM研究会研究報告 Vol.2014-AVM84 ( No.6 ) 1 - 4 2014年02月

　概要を見る

HTML5 では，ビットマップ形式のデータを扱う Canvas，ベクター形式のデータを扱う Inline SVG がサポートされ，JavaScript ベースで HTML 内に図形や画像を直接記述できる．しかし，Flash，Java などのプラグインを使用した場合と比べ，描画パフォーマンスが劣るという問題点がある．本研究では，Canvas および SVG における従来の描画パフォーマンス高速化手法の比較を行った．さらに，描画する要素に対して最適な手法を自動選択し，適切な手法で描画するアルゴリズムを提案する．In HTML5, Canvas dealing with data of the bitmap and Inline SVG dealing with Data for vector are supported. Thus we can directly describe image or graphic in HTML, by the JavaScript. However, drawing performance is not good compared with the plugins such as Flash or Java. In this paper, we sinvestigate the performance of traditional approaches to increase drawing speed. Finally, we propose an algorithm to select the best approach automatically.

CiNii
マンガからの自動人物検出と識別に関する一検討

石井大祐, 渡辺裕

画像電子学会論文誌 Vol. 42 ( No. 4 ) 457 - 465 2013年07月

　概要を見る

近年，マンガは日本国内のみならず海外においても多数の読者を持つコンテンツとなっている．また，電子書籍閲覧可能な端末の進化と電子書籍市場の発展により，電子化されたマンガの需要も高まっている．マンガは絵，文字，擬音などの多数の要素がすべて一枚の誌面上に描かれるため，大変複雑な構造を持つ．これら登場人物や多数の要素はメタデータとして取り出すことで，ディジタル化されたマンガの利便性を向上させることができる．マンガは基本的に白黒で描かれるため，画像の特性は自然画像と異なる．このため，マンガの登場人物解析を行うためには，マンガの登場人物の特徴をうまく捉える手法を用いる必要がある．本稿では，HOG (Histograms of Oriented Gradient) とSVM (Support Vector Machine) による登場人物の瞳，顔の検出処理と，登場人物の識別処理を組み合わせることで，マンガのページ上から，特定登場人物の検出を行う手法を提案する．結果として，提案手法による特定登場人物の検出で最大0.93 のPrecision を得た．

DOI CiNii
マンガ固有の特徴を利用したマンガ登場人物識別に関する一検討

石井, 山﨑, 渡辺

情報処理学会AVM研究会研究報告 Vol.2013-AVM80 ( No.1 ) 1 - 4 2013年04月
マンガ上のキャラクター識別に関する一検討

石井, 山﨑, 渡辺

情報処理学会全国大会 3D-2 2013年03月
フーリエ記述子と色情報を用いた野菜識別手法

山﨑, 石井, 渡辺

情報処理学会AVM研究会研究報告 Vol.2012-AVM79 ( No.27 ) 1 - 4 2012年12月
Multi Size Eye Detection on Digitized Comic Image

D. Ishii, T. Yamazaki, H. Watanabe

IIEEJ 3rd Image Electronics and Visual Computing Workshop (IEVC 2012) 2012年11月 [査読有り]
マンガ画像における瞳検出の解像度依存性に関する検討

石井, 渡辺

2012年度画像電子学会年次大会（Media Computing Conference 2012） R2-4 2012年06月
マンガ画像からの自動キャラクター位置検出に関する検討

石井, 渡辺

Vol.2012-AVM76 ( No.2 ) 1 - 5 2012年02月
A Modified Parabolic Prediction Based Fractional Pixel Motion Estimation Using Preset Corrector

Chang-Uk Jeong, Hiroshi Watanabe

18TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2012): GREEN AND SMART COMMUNICATIONS FOR IT INNOVATION 526 - 527 2012年 [査読有り]

　概要を見る

In general, a motion estimator in today's video coding includes fractional pixel motion estimation (FME) as well as integer pixel motion estimation (IME). FME can get better quality performance at the cost of higher computational complexity than IME alone. In this paper, a modified parabolic prediction based FME for H.264 video coding is proposed. The proposed method uses specific correction coefficients to improve the PSNR performance of the existing prediction based algorithm. In the simulation results, compared with the original algorithm, our technique shows more accurate predictive power, meanwhile, it does not require the interpolation process.
Interpolation-Free Fractional Pixel Motion Estimation Based on Data Trend Approximation

Chang-Uk Jeong, Hiroshi Watanabe

2012 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS (ICSES) 52 - 62 2012年 [査読有り]

　概要を見る

High computational complexity of a fractional pixel motion estimation (FME) module can in no way be negligible although the module improves visual quality after the integer pixel motion estimation (IME) process. Most of the conventional FME methods must include the interpolation procedure to form fractional pixel search points from the information of the integer pixel matching error costs. The interpolation, however, requires a considerable memory usage and certain amount of processing time. In this paper, therefore, we propose interpolation-free fractional pixel motion estimation techniques by using the data trend approximation and the error cost scaling. The results of our simulation show that the proposed methods produce similar or better performances compared with the existing FME methods, whereas they do not need any additional search points.
Automatic Scale Detection for Contour Fragment Based on Difference of Curvature

Kei Kawamura, Daisuke Ishii, Hiroshi Watanabe

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS E94D ( 10 ) 1998 - 2005 2011年10月 [査読有り]

　概要を見る

Scale-invariant features are widely used for image retrieval and shape classification. The curvature of a planar curve is a fundamental feature and it is geometrically invariant with respect it the coordinate system. The curvature-based feature varies in position when multiscale analysis is performed. Therefore, it is important to recognize the scale in order to detect the feature point. Numerous shape descriptors based on contour shapes have been developed in the field of pattern recognition and computer vision. A curvature scale-space (CSS) representation cannot be applied to a contour fragment and requires the tracking of feature points. In a gradient-based curvature computation, although the gradient computation considers the scale, the curvature is normalized with respect to not the scale but the contour length. The scale-invariant feature transform algorithm that detects feature points from an image solves similar problems by using the difference of Gaussian (DoG). It is difficult to apply the SIFT algorithm to a planar curve for feature extraction. In this paper, an automatic scale detection method for a contour fragment is proposed. The proposed method detects the appropriate scales and their positions on the basis of the difference of curvature (DoC) without the tracking of feature points. To calculate the differences, scale-normalized curvature is introduced. An advantage of the DoC algorithm is that the appropriate scale can be obtained from a contour fragment as a local feature. It then extends the application area. The validity of the proposed method is confirmed by experiments. The proposed method provides the most stable and robust scales of feature points among conventional methods such as curvature scale-space and gradient-based curvature.

DOI
Sub-pixel position prediction techniques based on Bazier curve

C. Jeong, H. Watanabe

IPSJ AVM Technigal Report Vol.2011-AVM73 ( No.8 ) 1 - 6 2011年07月

　概要を見る

The high computation load of fractional-pixel motion estimation (FME) cannot be ignored although it improves visibly the PSNR performance after the integer-pixel motion estimation (IME) process. Most of conventional FME methods must include the interpolation procedure to form the fractional-pixel points from the integer-pixel information. However, the interpolating requires a considerable memory usage and any amount of processing time. In this paper, therefore, we propose interpolation-free fractional-pixel position prediction techniques based on Bezier curve and post the results of the simulation which show a remarkable performance improvement compared to the existing FME methods in terms of PSNR.The high computation load of fractional-pixel motion estimation (FME) cannot be ignored although it improves visibly the PSNR performance after the integer-pixel motion estimation (IME) process. Most of conventional FME methods must include the interpolation procedure to form the fractional-pixel points from the integer-pixel information. However, the interpolating requires a considerable memory usage and any amount of processing time. In this paper, therefore, we propose interpolation-free fractional-pixel position prediction techniques based on Bezier curve and post the results of the simulation which show a remarkable performance improvement compared to the existing FME methods in terms of PSNR.

CiNii
Automatic preview generation of comic episodes for digitized comic search.

Keiichiro Hoashi, Chihiro Ono, Daisuke Ishii, Hiroshi Watanabe

MM’11 Proceedings of the 19th ACM international conference on Multimedia 1489 - 1492 2011年 [査読有り]

DOI
A Study on Frame Position Detection of Digitized Comics Images

D. Ishii, H. Watanabe

Workshop Picture Coding Symposium WP3-17 124 - 125 2010年12月 [査読有り]
A Linear Model-Based Sub-Pixel Motion Estimation Method For H.264/AVC Standard

C. Jeong, H. Watanabe

Workshop Picture Coding Symposium WP1-5 11 - 12 2010年12月 [査読有り]
多重記述符号化における冗長係数群の最適化手法

石川, 渡辺

情報処理学会AVM 研究会研究報告 Vol.2010-AVM71 ( No.21 ) 1 - 6 2010年12月
A Vertically Symmetrical Linear Model-Based Fractional-Pixel Motion Estimation Algorithm for H.264/AVC Encoder

C. Jeong, H. Watanabe

IPSJ SIG Technical Report Vol.2010-AVM71 ( No.20 ) 1 - 5 2010年12月
全変動最小化の高速計算手法

河村, 石井, 渡辺

電子情報通信学会論文誌 Vol. J93-D ( No.3 ) 326 - 335 2010年03月

　概要を見る

全変動は輝度変化が滑らかであるという仮定を導入することなく画像を扱える正則化基準である.全変動最小化によって入力画像から骨格成分を取得する手法として,(1)骨格成分とは双対の関係にある振動成分に着目し,半陰的再急降下法により解く手法や,(2)ノイズエネルギーが既知の値以下であるという制約条件付き最小二乗問題と全変動を画像の分類から決定される値にする凸最適化問題の二つを交互に解く手法が提案されている.しかし,これらの手法は計算コストが高いという問題がある.本論文では,全変動最小化を高速に計算する手法を提案する.まず,全変動を二次元画像に適用して離散化する際に,平方根計算が必要ないノルムを用いる.次に,劣こう配を用いた反復処理で骨格成分を計算する.双対変数を計算しないこと,FFTを含む射影関数や平方根計算が不必要,浮動小数点の計算回数の削減により計算量を削減する.計算時間を従来手法の(1)に対して53%,(2)に対して5%へと大幅に削減できることを実験結果により示す.提案手法は従来手法と同等の性能を維持したまま,陽に忠実化項と平滑化項から構成される反復処理を可能とする.

CiNii
コミック画像の文字位置特定に関する検討

河村, 石井, 渡辺

電子情報通信学会総合大会 D-12-52 2010年03月
過完備変換係数の最適化における画質評価に関する考察

石川, 渡辺, 坂東, 高村, 如澤

電子情報通信学会総合大会 D-11-87 2010年03月
四元数を用いた画像符号化方式の検討

石川, 渡辺

画像符号化シンポジウム PCSJ2009 P-4.03 75 - 76 2009年10月
領域分割に基づくベクタ表現を用いた画像符号化方式の検討

河村, 石井, 渡辺

画像符号化シンポジウムPCSJ2009 P-3.09 2009年10月
フレーム理論に基づく信号の冗長度制御方式

石川, 渡辺

FIT2009 I-045 331 - 332 2009年09月
線画の混在する画像におけるテキスト領域抽出の改善手法

河村, 石井, 渡辺

FIT2009 H-027 2009年09月
［招待講演］モバイルディジタルテレビジョンのためのビデオ符号化

渡辺

信学技報 MoMuC2009-28 vol.109 ( no.204 ) 9 - 14 2009年09月
フレーム展開による画像の多重記述符号化に関する検討

石川, 渡辺

情報処理学会 AVM 研究会研究報告 Vol.2009-AVM65 ( No.16 ) 1 - 6 2009年07月
コミック画像のコマ分割処理における制御パラメータに関する検討

石井, 河村, 渡辺

電子情報通信学会パターン認識・メディア理解研究会 PRMU2009-34 ( 34 ) 187 - 192 2009年05月

CiNii
[チュートリアル招待講演]MPEGビデオ符号化方式

渡辺裕

電子情報通信学会技術研究報告 NS2009-14 71 - 76 2009年04月

CiNii
グラデーションメッシュ最適化手法の検討

河村, 石井, 渡辺

電子情報通信学会総合大会 D-11-36 2009年03月
角度と複比を用いた複数車線検出に関する一検討

谷, 河村, 渡辺

電子情報通信学会総合大会 D-11-68 2009年03月
並列木複素ウェーブレット変換のブロック歪に関する考察

石川, 渡辺, 坂東, 高村, 上倉, 八島

電子情報通信学会総合大会 AS-3-3 2009年03月
Noise Shapingを用いた並列木複素wavelet変換符号化に関する基礎的考察

坂東, 高村, 上倉, 八島, 石川, 渡辺

電子情報通信学会総合大会 D-11-57 2009年03月
高ダイナミックレンジ画像生成におけるゴースト除去手法に関する検討

高木, 渡辺

電子情報通信学会総合大会 D-11-51 2009年03月
Automatic Scale Detection Based on Difference of Curvature

K. Kawamura, D. Ishii, H. Watanabe

2009 International Workshop on Advanced Image Technology (IWAIT 2009) 2009年01月 [査読有り]

CiNii
グラデーションメッシュによる画像符号化の基礎検討

河村, 石井, 渡辺

画像符号化シンポジウム PCSJ2008 P-5.04 2008年10月
アンチエイリアシングを考慮したベクタ表現による濃淡画像符号化に関する一検討

石井, 河村, 渡辺

画像符号化シンポジウム PCSJ2008 P-2.04 2008年10月
画像の不確定性符号化におけるレート制御方式に関する検討

石川, 渡辺

画像符号化シンポジウム PCSJ2008 P-2.02 2008年10月

CiNii
フレーム展開による画像の不確定性符号化に関する検討

石川, 渡辺

FIT2008 I-016 2008年09月
複比を用いた複数車線検出に関する検討

谷, 河村, 渡辺

FIT2008 H-040 2008年09月
スケール不変な曲線特徴に関する検討

河村, 石井, 渡辺

河村, 石井, 渡辺 H-025 2008年09月
移動物体が存在する複数画像からの高ダイナミックレンジ画像生成に関する検討

高木, 渡辺

情報処理学会AVM研究会 2008AVM62-03 2008年09月
SPORT: An algorithm for Divisible Load Scheduling with result collection on heterogeneous systems

Abhay Ghatpande, Hidenori Nakazato, Olivier Beaumont, Hiroshi Watanabe

IEICE TRANSACTIONS ON COMMUNICATIONS E91B ( 8 ) 2571 - 2588 2008年08月

　概要を見る

Divisible Load Theory (DLT) is an established mathematical framework to study Divisible Load Scheduling (DLS). However, traditional DLT does not address the scheduling of results back to source (i.e., result collection), nor does it comprehensively deal with system heterogeneity. In this paper, the DLSRCHETS (DLS with Result Collection on HET-erogeneous Systems) problem is addressed. The few papers to date that have dealt with DLSRCHETS, proposed Simplistic LIFO (Last In, First Out) and FIFO (First In, First Out) type of schedules as solutions to DLSRCHETS. In this paper, a new polynomial time heuristic algorithm, SPORT (System Parameters based Optimized Result Transfer), is proposed as a solution to the DLSRCHETS problem. With the help of simulations, it is proved that the performance of spoRT is significantly better than existing algorithms. The other major contributions of this paper include, for the first time ever, (a) the derivation of the condition to identify the presence of idle time in a FIFO schedule for two processors, (b) the identification of the limiting condition for the optimality Of FIFO and LIFO schedules for two processors, and (c) the introduction of the concept of equivalent processor in DLS for heterogeneous systems with result collection.

DOI
フレーム展開による信号の冗長度制御方式に関する検討

石川, 渡辺

電子情報通信学会画像工学研究会技術報告 IE2008-43 Vol.108 ( No.127 ) 61 - 66 2008年07月

　概要を見る

多重記述符号化の冗長度制御方式としてフレーム展開を用いる手法を提案する.ウェーブレット変換後の係数群をラティスにより複数のパケットに均等分割する従来手法では,パケット間の冗長度がラティスの構造に依存しており信号自体の冗長度には着目していない.そこで本検討では,信号を直接的に冗長表現するためにウェーブレット係数群をフレーム展開する.従来方式と比較して,パケットロスが生じる場合のエラー耐性が高まることを実験で確認した.

CiNii
Analysis of Divisible Load Scheduling with result collection on heterogeneous systems

Abhay Ghatpande, Hidenori Nakazato, Olivier Beaumont, Hiroshi Watanabe

IEICE TRANSACTIONS ON COMMUNICATIONS E91B ( 7 ) 2234 - 2243 2008年07月 [査読有り]

　概要を見る

Divisible Load Theory (DLT) is an established framework to study Divisible Load Scheduling (DLS). Traditional DLT ignores the result collection phase, and specifies no solution to the general case where both the network speed and computing capacity of the nodes are heterogeneous. In this paper, the DLS with Result Collection on HETerogeneous Systems (DLSRCHETS) problem is formulated as a linear program and analyzed. The papers to date that have dealt with result collection, proposed Simplistic LIFO (Last In, First Out) and FIFO (First In, First Out) type of schedules as solutions. The main contributions of this paper are: (a) A proof of the Allocation Precedence Condition, which is inconsequential in LIFO or FIFO, but is important in a general schedule. (b) A proof of the Idle Time Theorem, which states that irrespective of whether load is allocated to all available processors, in the optimal solution to the DLSRCHETS problem, at the most one processor that is allocated load has idle time, and that the idle time exists only when the result collection begins immediately after the completion of load distribution.

DOI
曲率空間尺度を用いたベクタ変換の評価手法に関する検討

河村, 石井, 渡辺

情報処理学会AVM研究会 2007AVM60-05 2008年03月
直感的な曲線操作を実現するベクタ変換の検討

河村, 石井, 渡辺

情報処理学会全国大会 4B-6 2008年03月
複素ウェーブレットの疎表現における係数選択手法の検討

高橋, 渡辺

情報処理学会全国大会 3Y-2 2008年03月
ベクタ表現を用いた画像のエッジ再現手法に関する一検討

石井, 河村, 渡辺

情報処理学会全国大会 3Y-1 2008年03月
グローバル動き補償を用いた背景フレーム補間手法に関する検討

東, 渡辺

情報処理学会全国大会 3W-4 2008年03月
全変動ノルムの離散化手法に関する検討

河村, 石井, 渡辺

電子情報通信学会総合大会 D-11-82 2008年03月
Divisible Load Scheduling with result collection on heterogeneous systems

Abhay Ghatpande, Hidenori Nakazato, Hiroshi Watanabe, Olivier Beaumont

2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8 372 - + 2008年 [査読有り]

　概要を見る

Divisible Load Theory (DLT) is an established mathematical framework to study Divisible Load Scheduling (DLS). However traditional DLT does not comprehensively deal with the scheduling of results back to source (i.e., result collection) on heterogeneous systems. In this paper, the DLSRCHETS (DLS with Result Collection on HETerogeneous Systems) problem is addressed. The few papers to date that have dealt with DLSRCHETS, proposed simplistic LIFO (Last In, First Out) and FIFO (First In, First Out) type of schedules as solutions to DLSRCHETS. In this paper, a new heuristic algorithm, ITERLP, is proposed as a solution to the DLSRCHETS problem. With the help of simulations, it is proved that the performance of ITERLP is significantly better than existing algorithms.
位置ずれを持つ画像に対する高ダイナミックレンジ画像の生成手法に関する検討

高木, 渡辺

情報処理学会AVM研究会 2007AVM59-17 ( 64 ) 101 - 104 2007年12月

CiNii
疑似TVノルムの数値計算とその領域分割への応用に関する検討

河村, 石井, 渡辺

情報処理学会AVM研究会 2007AVM59-10 IE2007 - 122 2007年12月

CiNii
方向性フィルタバンクによる動画像符号化方式に関する基礎検討

石川, 渡辺

画像符号化シンポジウム PCSJ2007 P-3.02 2007年11月
人工的な多値画像のベクター変換

河村, 石井, 渡辺

画像符号化シンポジウム PCSJ2007 P-2.21 2007年11月
動画像の時空間ベクタ表現に関する一検討

石井, 河村, 渡辺

画像符号化シンポジウム PCSJ2007 P-1.03 2007年11月

CiNii
ベクター表現に適した小数画素精度の改良境界線検出に関する検討

河村, 石井, 渡辺

FIT2007 I-045 2007年09月

CiNii
人工的な多値画像のベクタ変換を考慮した領域分割に関する検討

河村, 石井, 渡辺

情報処理学会AVM研究会 2007AVM58-07 2007年09月
双対木複素ウェーブレットにおける Energy Compaction の検討

高橋, 渡辺

情報処理学会AVM研究会 2007AVM58-06 2007年09月
コミックのコマ分割処理に関する一検討

石井, 河村, 渡辺

電子情報通信学会論文誌D Volume J90-D ( No.7 ) 1667 - 1770 2007年07月

　概要を見る

2分割を繰り返すことによりコミック画像を各コマに分割する手法が提案されている.本論文では,幅をもつ検査帯により分割線候補の検出を行い,分割線適合検査によりコマ分割を決定する手法を提案し,実験によりしきい値と分割精度について評価を行った.

CiNii
ベクター表現に適した少数画素精度の改良境界線検出に関する検討

河村, 石井, 渡辺

情報処理学会AVM研究会 2007 ( 22 ) 61 - 64 2007年03月

　概要を見る

画像入力装置の普及により，容易に紙面のディジタル化が可能になった．さらに，画像入力時間の短縮のために解像度は低い方が良い．一方で，解像度の異なる様々な端末に画像を表示するためには解像度変換が必須である．我々はこれまでに，解像度変換が容易なベクター表現に着目し，ベクター変換を考慮した小数画素精度の2値化を提案してきた．しかし，線に振動が生じてベクター変換効率を低下させる場合がある．そこで，本稿では，均等色領域に隣接するエッジに対して，その方向性を考慮した1次元フィルタによる補間を行い，2値化により境界線を取得する手法を提案する．1次元フィルタには解像度変換において線の振動を低減するためにステップ関数を用いる．Digitizing paper have been convenient since image input devices are disseminated. Further, an input resolution is lower, an input speed is higher. On the other hand, a resolution conversion is mandatory technique in order to display the image on any panels which have varying resolutions. We have proposed subpixel binarization for vectorization, which is suitable for resolution conversion. However, vibrations of line drawings or border lines may occur, and then vectorization efficiency decreases. In this paper, we propose subpixel binarization based on a directive one-dimensional filtering along a border region of homogeneous tone. A step function is adopted as the one-dimensional filter for suppression of the vibration.

CiNii
ベクター表現による文字や網点を含む画像符号化システム

河村, 山本, 石井, 渡辺

情報処理学会全国大会 3B-1 2007年03月

CiNii
コミック画像におけるコマ分割処理の高速化に関する検討

石井, 河村, 渡辺

情報処理学会全国大会 1P-2 2007年03月

CiNii
複雑背景における物体検出法

伊谷, 渡辺

電子情報通信学会総合大会 D-12-152 2007年03月
照明変化のある環境下での移動物体検出の検討

谷, 伊谷, 渡辺, 富永

電子情報通信学会総合大会 D-11-102 102 - 102 2007年03月

CiNii
複数画像によるダイナミックレンジ向上に関する検討

高木, 河村, 渡辺, 富永

電子情報通信学会総合大会 D-11-67 2007年03月
縮小画像を用いたフレーム補間におけるギャップ処理手法に関する検討

東, 渡辺

電子情報通信学会総合大会 D-11-65 65 - 65 2007年03月

CiNii
電子透かしを埋め込んだベクター画像の客観評価手法に関する検討

河村, 山本, 石井, 渡辺

電子情報通信学会総合大会 D-11-42 2007年03月
方向性フィルタバンクを用いた多重記述符号化における係数補間に関する検討

石川, 渡辺

電子情報通信学会総合大会 D-11-32 32 - 32 2007年03月

CiNii
時空間画像のJPEG 2000符号化によるPSNRの周期的変動の低減に関する検討

加藤, 渡辺

電子情報通信学会総合大会 D-11-28 2007年03月
エッジ成分のベクトル化による空間スケーラブル符号化

山本, 河村, 渡辺

電子情報通信学会総合大会 D-11-8 2007年03月
FGS映像における受信画質劣化防止に関する一検討

秋間, 渡辺, 富永

電子情報通信学会総合大会 D-11-7 2007年03月
View Interpolation を用いた多視点符号化の効率改善に関する一検討

後藤, 内藤, 渡辺

電子情報通信学会総合大会 D-11-6 2007年03月
複雑な背景における物体検出法の検討

伊谷, 渡辺

電子情報通信学会PRMU研究会 2006PRMU12-09 2006年12月
FGSビデオ受信時におけるビットストリームの信頼性に関する検討

秋間, 渡辺, 富永

情報処理学会AVM研究会 2006AVM55-20 ( 67 ) 51 - 54 2006年12月

CiNii
方向性フィルタバンクによる多重記述符号化に関する検討

石川, 渡辺

情報処理学会AVM研究会 2006AVM55-19 2006年12月
複素ウェーブレットを用いた画像符号化における情報量削減手法に関する検討

高橋, 河村, 渡辺

画像符号化シンポジウム PCSJ2006 P-5.09 2006年11月
画像の不確定性符号化について

石川, 渡辺

画像符号化シンポジウム PCSJ2006 P-3.02 2006年11月

CiNii
コミックのコマ分割処理に関する一検討

石井, 河村, 渡辺

画像符号化シンポジウム PCSJ2006 P-2.07 2006年11月
ベクター表現によるマンガ符号化システム

河村, 山本, 石井, 渡辺

画像符号化シンポジウム PCSJ2006 P-2.06 2006年11月
奥行き情報を用いた多視点映像符号化の符号量削減

後藤, 内藤, 渡辺

画像符号化シンポジウム PCSJ2006 P-1.03 2006年11月
動きオブジェクトベースのフレーム補間手法に関する検討

東, 渡辺

FIT2006 J-069 2006年09月
分割線選択によるコミックのコマ分割処理に関する検討

石井, 河村, 渡辺

FIT2006 J-033 2006年09月

CiNii
文字や網点を含む低解像度多値画像のベクター表現に関する検討

河村, 山本, 渡辺

FIT2006 J-032 261 - 262 2006年09月

CiNii
複素ウェーブレット変換を用いた画像符号化に関する一検討

高橋, 河村, 渡辺

FIT2006 J-031 259 - 260 2006年09月

CiNii
JPEG特許に関する最近の話題

渡辺

情報処理学会誌 Vol.47 ( No.9 ) 2006年09月
SPORT: Extended Simulations and Results for Divisible Load Scheduling on Heterogeneous Systems

A. Ghatpande, H. Nakazato, H. Watanabe

IEICE Society Conference BS-15-3 2006年09月
ベクター表現による文字や網点を含む画像符号化システム

河村, 山本, 石井, 渡辺

情報処理学会AVM研究会 2006AVM54-06 2006年09月
イントラ符号化にベクター表現を用いた動画像圧縮に関する検討

河村, 山本, 渡辺

情報処理学会全国大会 2F-2 2006年03月
ベクトル表現を用いた空間スケーラブル符号化

山本, 河村, 渡辺

電子情報通信学会総合大会 D-11-1 2006年03月
映像シーンに適応したスケーラビリティ選択手法

高橋, 石川, 渡辺, 富永

電子情報通信学会総合大会 D-11-19 2006年03月
フレーム補間におけるギャップ処理に関する検討

東, 渡辺, 永吉, 富永

電子情報通信学会総合大会 D-11-20 2006年03月
カラー画像の特性を考慮した超解像手法に関する検討

早雨, 河村, 永吉, 渡辺, 富永

電子情報通信学会総合大会 D-11-92 2006年03月
線画の混在する画像におけるテキスト領域の抽出に関する検討

河村, 山本, 渡辺

電子情報通信学会総合大会 D-11-99 2006年03月
人物のふるまいに着目した動的特徴量からの人物照合に関する一検討

守屋, 河村, 渡辺, 富永

電子情報通信学会総合大会 D-12-96 2006年03月
逆転音声による音声・音楽の半開示スクランブルに関する検討

本多, 河村, 渡辺, 富永

電子情報通信学会総合大会 D-14-15 2006年03月
A study on spatial scalable coding using vector representation

Yuki Yamamoto, Kei Kawamura, Hiroshi Watanabe

2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS MA1-P2.4 145 - + 2006年 [査読有り]

　概要を見る

The major advantage of vector representation of an image is that the image quality is maintained for arbitrary scaling. In recent years, a demand for scalable image coding has been increasing because of the wide variety of available digital contents and display terminals. Conventional scalable coding schemes are based on raster representation, and thus, line drawings deteriorate in quality when expanded and shrunk. In this paper, we propose an edge reconstruction method using vector representation for the purpose of keeping a consistent spatial scalability on transmission and display. We take an anti-aliasing into account in edge areas for approximation of luminance values around the edge. The proposed method can improve PSNR by up to 2 dB as compared to the conventional methods when image is expanded and shrunk.
Bit rate reduction of vector representation of binary images

Yuki Yamamoto, Kei Kawamura, Hiroshi Watanabe

2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS WP-P3.7 3105 - + 2006年 [査読有り]

　概要を見る

Vector representation of binary images has an advantage of keeping high image quality for arbitrary scaling as well as editing capability of an object. However, the vector representation suffers from low compression efficiency compared with JBIG. In this paper, we show the main cause reducing coding efficiency and propose two methods to improve it. The proposed methods can reduce the file size of a binary image up to about 30-40 percent.
ベクトル表現を用いた空間スケーラブル符号化に関する一検討

山本, 河村, 渡辺

情報処理学会AVM研究会 2005AVM51-14 75 - 80 2005年12月
動画像圧縮におけるベクター表現の符号化に関する検討

河村, 山本, 渡辺

情報処理学会AVM研究会 2005AVM51-15 81 - 86 2005年12月
品質変動低減のためのJPEG2000符号量制御方式

佐野, 内藤, 渡辺

情報処理学会AVM研究会 2005AVM51-17 93 - 98 2005年12月
複雑な環境における物体検出法の一検討

伊谷, 渡辺

画像符号化シンポジウム PCSJ2005 2005年11月
ベクトル表現を用いた時空間スケーラブル符号化におけるエッジ再構成に関する検討

山本, 河村, 渡辺

画像符号化シンポジウム PCSJ2005 2005年11月
イントラ符号化にベクター表現を用いた動画像圧縮に関する検討

河村, 山村, 渡辺

画像符号化シンポジウム PCSJ2005 2005年11月

CiNii
映像シーンに適応したスケーラビリティ選択手法に関する検討

高橋, 後藤, 石川, 渡辺, 富永

画像符号化シンポジウム PCSJ2005 P-5.13 2005年11月
動き補償付時間方向フィルタリングにおける適応的Update 手法に関する一検討

後藤, 石川, 渡辺

画像符号化シンポジウム PCSJ2005 2005年11月
時空間画像のJPEG 2000 符号化におけるSNR の周期的変動に関する考察

加藤, 石川, 渡辺

画像符号化シンポジウム PCSJ2005 P-2.18 2005年11月
ディジタルシネマのフリッカー低減を目的とした符号化手法

石川, 渡辺

画像符号化シンポジウム PCSJ2005 P-2.17 2005年11月

CiNii
H.264 Inter予測における動き補償ブロックサイズ高速決定手法

常松, 渡辺

情報処理学会AVM研究会 2005AVM50-3 ( 9 ) 1979 - 1981 2005年10月

CiNii
ベクトル変換におけるパスの本数と点の個数に関する検討

河村, 山本, 渡辺

FIT2005 J-075 2005年09月
時空間スケーラビリティを考慮したセルアニメーションにおけるエッジ再構成に関する一検討

山本, 河村, 渡辺

FIT2005 J-068 2005年09月
JPEG2000によるHDTV低遅延符号化のための符号量制御方式

佐野, 内藤, 渡辺

FIT2005 J-062 2005年09月
Motion JPEG 2000における前後フレームを用いたフリッカー低減手法の検討

伊谷, 渡辺

FIT2005 J-061 2005年09月
時空間画像のJPEG 2000符号化におけるSNR揺らぎについて

加藤, 石川, 渡辺

FIT2005 J-060 2005年09月
SPORT: A Near-Optimal Solution to Divisible Load Scheduling on Heterogeneous Systems

A.Ghatpande, H.Nakazato, H.Watanabe

2005 IEICE Society Conference BS-9-4 2005年09月
H.264/AVC におけるIntra 予測モードの決定手法に関する検討

常松, 渡辺

電子情報通信学会論文誌D-II Vol. J88-D2 ( No.9 ) 1979 - 1981 2005年09月

　概要を見る

H.264/AVCは符号化処理量が多い問題点がある. 多くはInter予測に関する処理であるが, 2種類の予測サイズが利用できるIntra予測も10%以上を占める. そこで異なる予測サイズのIntra予測結果を利用した処理量削減手法を提案する.

CiNii
空間スケーラビリティを用いた画像解像度変換法

石川, 渡辺

画像電子学会誌 Vol. 34 ( No. 5 ) 548 - 557 2005年09月
Motion JPEG 2000における静動領域判定に基づくフリッカー低減手法の検討

伊谷, 渡辺

情報処理学会AVM研究会 2005AVM49-14 ( 164 ) 77 - 82 2005年07月

CiNii
“フィルムグレインを考慮したH.264/AVC符号化方式に関する一検討

後藤, 常松, 渡辺

情報処理学会AVM研究会 2005AVM49-06 2005年07月
H.264における動き補償ブロックサイズ高速決定手法に関する一検討

常松, 後藤, 渡辺

情報処理学会AVM研究会 2005AVM49-04 ( 66 ) 19 - 24 2005年07月

　概要を見る

H.246/MPEG-4 AVCは高い符号化効率を実現する動画像符号化方式であるが、符号化処理量が多い問題点がある。符号化処理量の多くはInter予測に関連する処理が占めており、H.264/MPEG-4 AVCでは特にその比率が高い。従来の動画像符号化方式もInter予測の占める割合は高かったが、H.264/MPEG-4 AVCでは可変ブロックサイズ動き補償、2枚以上の複数参照フレームに加え1/4画素精度の動き補償が利用できるようになっており、符号化処理量増加の主たる原因となっている。そこで本稿では符号化対象マクロブロックと探索中に参照するマクロブロックの画素値の差分を利用することで動き補償ブロックサイズの探索を効率よく削減する手法を提案する。The performance improvement in H.264/MPEG-4 AVC is provided at the expense of higher computational complexity. Most of the complexity is caused by Inter prediction. To improve coding efficiency,some functions are added in H.264/MPEG-4 AVC,such as variable block size motion compensation,multi reference frame and quarter-pel motion compensation. A fast macroblock partition decision method is proposed in this paper. The macroblock size is efficiently determined by using the pixel value difference between encoding and the referred macroblock.

CiNii
時空間画像の符号化に関する一考察

加藤, 渡辺, 安田

情報処理学会全国大会 3W5 2005年03月
通過点削減によるベクトル表現の符号化効率改善

山本, 河村, 渡辺, 富永

情報処理学会全国大会 1L5 2005年03月
ベクター変換における符号化効率の改善

河村, 山本, 渡辺

情報処理学会全国大会 3W4 2005年03月
時空間画像の符号化による時間軸ノイズの低減に関する一考察

加藤, 渡辺, 安田

電子情報通信学会総合大会 D-11-67 2005年03月
Motion JPEG 2000におけるフリッカー低減手法の検討

伊谷, 石川, 渡辺, 富永

電子情報通信学会総合大会 D-11-61 61 - 61 2005年03月

CiNii
フィルムグレインを考慮したH.264/AVCモード決定手法に関する一検討

後藤, 常松, 渡辺, 富永

電子情報通信学会総合大会 D-11-56 56 - 56 2005年03月

CiNii
モードの偏りを用いたH.264/AVC FRExt Intra モード決定手法

常松, 渡辺

電子情報通信学会総合大会 D-11-55 55 - 55 2005年03月

CiNii
二値画像のベクトル表現における符号化効率改善

山本, 河村, 渡辺, 富永

電子情報通信学会総合大会 D-11-30 2005年03月
ベクター表現における階層的変換手法の検討

河村, 山本, 渡辺

電子情報通信学会総合大会 D-11-6 6 - 6 2005年03月

CiNii
Gradation approximation for vector based compression of comic images

K Kawamura, Y Yamamoto, H Watanabe

2005 International Conference on Image Processing (ICIP), Vols 1-5 WedAmPO6 2845 - 2848 2005年 [査読有り]

　概要を見る

In this paper, we propose a method to vectorize comic images including halftone dots. Our method can prevent the jaggy and moire phenomena when the images are enlarged and shrank. The method contains three modules: halftone dots separation, gradation approximation, and vectorization. At the first module, small isolated areas and altered areas by dilation and erosion operations are separated into halftone dots images. At the second module, areas of halftone are approximated by contours and gradation parameters. At the last module, both halftone areas and line drawings are vectorized and approximated by smooth contours. The size of compressed file produced by our method is equal or smaller than JBIG compression. Validity of the proposed method is confirmed by experimental results.
H.264/AVC FRExtにおけるIntra予測モードの決定手法に関する一検討

常松, 渡辺

情報処理学会AVM研究会 2004AVM47-25 ( 77 ) 55 - 60 2004年12月

CiNii
Motion JPEG2000を利用したマルチ解像度変換システム

石川, 渡辺

情報処理学会AVM研究会 2004AVM47-25 2004年12月
Motion JPEG2000低遅延符号化のための時間方向ビット配分

佐野, 内藤, 渡辺

情報処理学会AVM研究会 2004AVM47-24 ( 77 ) 43 - 48 2004年12月

CiNii
ベクター表現の階層化に関する検討

河村, 山本, 渡辺

情報処理学会AVM研究会 2004AVM47-23 2004年12月
Distributed Video Encoding on Single-level Heterogeneous Processor Trees

A.Ghatpande, H.Nakazato, H.Watanabe

画像符号化シンポジウム PCSJ2004 P-5.11 2004年11月
二値画像のベクトル化における符号量削減に関する検討

山本, 河村, 渡辺, 富永

画像符号化シンポジウム PCSJ2004 P-2.04 2004年11月

CiNii
H.264/AVCにおけるIntra予測モードの決定手法に関する一検討

常松, 渡辺

画像符号化シンポジウム PCSJ2004 P-5.16 2004年11月

CiNii
Distributed Video Encoding on Single-level Heterogeneous Processor Trees

A.Ghatpande, H.Nakazato, H.Watanabe

画像符号化シンポジウム PCSJ2004 P-5.11 2004年11月
ベクター表現における符号量制御に関する検討

河村, 山本, 渡辺

画像符号化シンポジウム PCSJ2004 P-5.04 2004年11月

CiNii
JPEG2000 Precinct構造を用いた低遅延HDTV符号化方式

佐野, 内藤, 渡辺

情報処理学会AVM研究会 2004AVM46-02 2004年10月

CiNii
ベクター変換における曲線最適化アルゴリズムの一検討

河村, 渡辺

FIT2004 J-025 2004年09月

CiNii
JPEG2000 Precinct構造による低遅延HDTV符号化方式

佐野, 内藤, 渡辺

FIT2004 J-076 2004年09月
網点を含む2値画像のベクトル表現による高能率符号化に関する検討

河村, 渡辺

画像電子学会年次大会 2004年06月
スプライトを適応的に利用したMPEG-4映像符号化システム

秦泉寺, 岡田, 八島, 渡辺

画像電子学会誌 No.33, Vol.3 ( 3 ) 333 - 342 2004年06月

　概要を見る

著者らは，MPEG-4 Main Profileにおけるスプライトという新しい符号化ツールに着目し，動画像におけるカメラ操作のみを反映した背景領域にスプライトを適用する方法を検討してきた．スプライトが効果的な場合は，従来符号化法に比べて劇的な符号量の削減をもたらすことを明らかにしてきた．一方で，スプライトがどのような画像にも効果があるとは限らないということも知られており，適応的な利用が望まれていた．本稿では，スプライトを適応的に符号化へとりこみ，MPEG-4 Main Profile準拠の符号化システムとして開発した．本符号化システムは，スプライトが効果があると推定されるショットを自動的に抽出するスプライトショット抽出法を取り入れ，符号化品質の向上を図った．また，ストリーミング配信への適用も考慮し，低遅延スプライトに適用する2層ビデオオブジェクトレート制御法を新たに考案した．これにより，コンスタントビットレートの符号化データの生成を可能とした．また，カメラの動きのあるスポーツ映像を用いて，本提案符号化システムの評価を行い，その優位性と適応条件について考察した．<br>

DOI CiNii
モバイル環境におけるユーザクラスタリングを用いた情報推薦システムの検討

金田, 渡辺

情報処理学会AVM研究会 2003AVM44-04 2004年03月
マンガ符号化における網点の多階調近似処理の評価

河村, 渡辺, 富永

情報処理学会全国大会 3Z-1 2004年03月
マンガ符号化における濃度勾配検出法の改良

河村, 渡辺, 富永

情報処理学会AVM研究会 2003AVM44-03 ( 25 ) 13 - 18 2004年03月

　概要を見る

我々は網点を階調表現に変換し，線画と共にベクトル表現に変換するマンガ符号化を提案してきた．階調近似処理において，従来手法では，固定サイズの小領域を複数個用いてグラデーションの向き，勾配，濃度を決定してきたが，モアレの発生により正しく濃度を決定できない場合があった．また，入力解像度や網点の線数の変化に対するロバスト性が欠けていた．本稿では，2値画像において網点周期を検出する手法を検討し，網点周期を利用してモアレの発生を低減させた濃度勾配検出法を提案する．シミュレーション実験により，提案手法の有効性を確認した．Comic coding by gradation approximation and vector representation has been proposed. When gradation approximation is processed in conventional method, we used some small areas with constant size. However, in some cases, we could not detect correctly because of the moire. In this paper, we study a detection method using a period of halftone dots. Then, we propose an improved method for density-gradient detection. Our approach decreases occurrence of moire. Through some simulation experiments, we have confirmed that our proposed method is valid.

CiNii
ディジタルシネマにおける階層符号化に関する検討

石川, 田邉, 渡辺

電子情報通信学会総合大会 D-11-15 2004年03月
Motion-JPEG2000の符号量制御のためのROI符号化に関する検討

佐野, 渡辺, 富永

電子情報通信学会総合大会 D-11-17 2004年03月
Motion JPEG2000映像配信のためのマルチ解像度変換法

田邉, 石川, 渡辺

情報処理学会AVM研究会 2003AVM44-19 2004年03月

CiNii
Motion JPEG2000におけるフリッカー雑音とタイリングの関係について

加藤, 石川, 渡辺

情報処理学会AVM研究会 2003AVM44-18 2004年03月
2値画像のベクトル表現における網点の階調近似処理に関する検討

河村, 渡辺, 富永

電子情報通信学会総合大会 D-11-106 2004年03月
Vector representation of binary images containing halftone dots

K Kawamura, H Watanabe, H Tominaga

2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3 TP2-1 335 - 338 2004年 [査読有り]

　概要を見る

Vector representation of graphics has an advantage of being displayed at any size. When resolution of a bitmap image is changed, lack of a line segment arises. In addition, moire occurs when resolution of an image with halftone dots is resized. In this paper, we propose a new technique to convert the binary image with halftone dots into the vector representation. Resolution conversion of binary image can easily be performed without moire by continuous tone approximation of halftone dots. First, we separate the area of halftone dots and line drawings in the image. Next, a continuous tone approximation is applied to the area of halftone dots. Then, the conventional vectorization is applied to both continuous tone areas and line drawings. Finally these components are mixed and reconstructed. Our approach provides an efficient way of displaying cartoon like images at any size with limited amount of data.
マンガ符号化における網点のグラデーション処理の検討

河村, 渡辺, 富永

情報処理学会AVM研究会 2003AVM43-8 2003年12月
網点を含んだ2値画像のベクトル表現に関する検討

河村, 渡辺, 富永

画像符号化シンポジウム PCSJ2003 P-2.13 2003年11月

CiNii
ROIを利用したMotionJPEG2000の符号量制御に関する検討

佐野, 渡辺, 富永

画像符号化シンポジウム PCSJ2003 P-2.14 39 - 40 2003年11月

CiNii
Motion JPEG 2000における最適解像度変換法

田邉, 渡辺, 富永

画像符号化シンポジウム PCSJ2003 P-1.02 2003年11月

CiNii
An Architecture for Distributed Video Encoding on the Internet

A.Ghatpande, H.Nakazato, H.Watanabe

画像符号化シンポジウム PCSJ2003 P-2.02 2003年11月
Distributed computing for real-time video processing

H.Watanabe, A.Ghatpande, H.Nakazato

1st International Conference on Ubiquitous Computing (ICUC) 2003, 2003年10月 [査読有り]
マンガの超高圧縮符号化に関する検討

河村, 渡辺, 富永

情報処理学会AVM研究会 2003AVM42-2 ( 99 ) 7 - 16 2003年10月

　概要を見る

マンガは線画を中心に構成されているため，ベクトル化が有効な符号化手法である．しかし，単純なベクトル化は，網点の存在により逆に符号量の増加をまねく．また，網点を含んだままの解像度変換はモアレの発生原因となる．そこで，本稿では，マンガをベクトル表現する際に網点部分を分離して多階調近似し，残りの線分と共に符号化する手法について検討する．Vectorization is an effective technique for comic image coding, since comics are mainly consist of line drawings. However, an existence of halftone-dots causes an increase of coding bitrate if simple vectorization is used. In addition, moires occur when a resolution of the image with halftone-dots is changed. In this paper, we propose a new technique to achieve highly efficient comic image coding. First, we separate the area of halftone-dots and line drawings from an image. Then, a continuous tone approximation is applied to the area of halftone-dots. Next, the conventional vectorization is applied to line drawings. Finally, these two components are mixed together.

CiNii
大規模コンテンツ配信のためのアプリケーションレベル・マルチキャストツリー構築に関する検討

清水, 亀山, 渡辺

FIT2003 L-011 2003年09月
モバイル環境におけるユーザの嗜好と位置を利用した情報推薦システムの検討

金田, 亀山, 渡辺

FIT2003 M-092 2003年09月
Motion JPEG2000におけるフリッカー雑音と量子化誤差の関係について

加藤, 田邉, 渡辺, 富永

FIT2003 J-039 2003年09月

CiNii
JPEG2000におけるテキスト分離符号化方式に関する検討

石川, 富永, 渡辺

FIT2003 J-036 2003年09月
Transcoding by automatic ROI extraction from JPEG2000 bitstream

H.Watanabe, T.Ogita

PCS2003 WP1.4-7 2003年04月 [査読有り]
IDG基礎から学ぶ情報圧縮技術

亀山, 金子, 渡辺

（株）IDGジャパン ISBN4-87280-486-4 2003年04月
差分画像を利用したアニメーション映像からのオブジェクト抽出

中神, 渡辺, 富永

情報処理学会AVM研究会 2003AVM40-06 2003年03月
ホストの相対的位置情報を利用したコンテンツ配信に関する検討

清水, 亀山, 渡辺

電子情報通信学会総合大会 B-6-170 2003年03月
Wavelet符号化を動画像に適用した際の視覚的歪みに関する検討

加藤, 田邉, 渡辺, 富永

電子情報通信学会総合大会 D-11-15 2003年03月

CiNii
Motion JPEG2000における最適解像度変換法の検討

田邉, 渡辺, 富永

情報処理学会AVM研究会 2003AVM40-04 ( 24 ) 19 - 23 2003年03月

　概要を見る

デジタルシネマやHDTVなどの超高解像度映像に対する符号化方式としてMotion JPEG2000の期待が高まっている．Motion JPEG2000ではwavelet変換を用いているため，解像度スケーラビリティが容易に実現できるといわれている．しかし，実際に用いられているwaveletフィルタは，現在解像度変換に用いられているデシメーションフィルタと比べると性能が悪く，また任意サイズに対応した解像度変換が実現できないといった問題がある．本研究では，最適解像度変換法の提案を目的とし，フィルタの性能比較や画質比較などの基礎検討を行った．Motion JPEG2000 is regarded as a promising coding system for an ultra high quality image like digital-cinema and HDTV. Wavelet transform used in Motion JPEG2000 enables to realize spatial scalability easily. However, the performance of the wavelet filter used in Motion JPEG2000 is worse than those of decimation filters used in image resampling and resizing. What is worse, the wavelet filter can only resize images to half resolution. We aim to provide of the best resampling method for Motion JPEG2000. In this paper, we investigate the performance of filters and its relationship with image quality.

CiNii
ディジタル放送教科書（上）- IDG情報通信シリーズMPEG-1/MPEG-2/ MPEG-4

（株）IDGジャパン ISBN4-87280-477-5 2003年02月
Very low bit rate video compression using MPEG-4 sprite coding

K Jinzenji, H Watanabe, S Okada, N Kobayashi

ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS 86 ( 6 ) 83 - 93 2003年 [査読有り]

　概要を見る

The new MPEG-4 image coding standard that has recently been established offers higher image quality in low-bit-rate coding than conventional coding methods such as H.26X, MPEG-1, and MPEG-2, as well as coding for each object unit (a new function).
This paper proposes an algorithm for the automatic extraction of video objects consisting of a foreground object and a background sprite. It is based on "sprite coding," which can greatly reduce the number of code bits compared to conventional methods. The foreground and background video objects processed by the proposed method are subjected to MPEG-4 object coding and sprite coding, respectively, and are compared with MPEG-4 simple profile (normal) coding. In particular, the influence of the frame rate and the foreground object rate on the number of code bits was investigated. It was found that the proposed method can reproduce images with the same quality while using about one-fourth to one-half as many code bits as the normal coding method when the foreground rate is 10 to 15% of the whole image, regardless of the frame rate. In addition, the proposed method can attain more than double the frame rate of the normal coding method at a very low bit rate (e.g., 128 and 64 kbit/s) with a similar SNR. (C) 2003 Wiley Periodicals, Inc. Electron Comm Jpn Pt 2, 86(6): 83-93, 2003; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ecjb.10158.

DOI
A novel content distribution architecture utilizing network distance prediction

N Shimizu, W Kameyama, H Watanabe

7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL, III, PROCEEDINGS Proc. Vol 13 - 17 2003年 [査読有り]

　概要を見る

Contents Delivery/Distribution Network (CDN) and Peer-to-Peer (P2P) network have been proposed for a large-scale contents distribution. We have been investigating what mechanisms are required for CDN, P2P, and both of them. It turns out that 1) optimal server selection and 2) content discovery are key role functions. In this paper, we propose a method for optimal server selection. The optimal server is defined to minimize end-to-end delay between a requesting host and the server. Here, end-to-end delay is regarded as network distance. Round trip time (RTT) can be used as a metric to predict end-to-end delay. However, it takes much cost to send a probe packet to a whole set of hosts in order to measure all RTT. Our proposed method send a probing packet to a set of servers called Landmarks, and predict which server is optimal based on these RTT. Simulation result shows the validity of the proposed method. We confirm that this approach can adapt to scalable architecture.
モバイルカメラを利用した地理情報提示方法に関する基礎検討

金田, 吉沢, 亀山, 渡辺

映像メディア処理シンポジウム IMPS2002 I-2.11 2002年11月
アニメーション映像における動き情報検出の検討

中神, 渡辺, 富永

画像符号化シンポジウム PCSJ2002 P-4.03 2002年11月
Wavelet変換符号化における量子化歪について

田邉, 加藤, 渡辺, 富永

画像符号化シンポジウム PCSJ2002 P-3.02 2002年11月

CiNii
MPEG-4 very low bitrate video coding by adaptively utilizing sprite

K.Jinzenji, S.Okada, N.Kobayashi, H.Watanabe

IEEE International Conference on Multimedia Expo (ICME) WedAmPO3: Compression II ( 3 ) 333 - 342 2002年08月 [査読有り]

　概要を見る

著者らは，MPEG-4 Main Profileにおけるスプライトという新しい符号化ツールに着目し，動画像におけるカメラ操作のみを反映した背景領域にスプライトを適用する方法を検討してきた．スプライトが効果的な場合は，従来符号化法に比べて劇的な符号量の削減をもたらすことを明らかにしてきた．一方で，スプライトがどのような画像にも効果があるとは限らないということも知られており，適応的な利用が望まれていた．本稿では，スプライトを適応的に符号化へとりこみ，MPEG-4 Main Profile準拠の符号化システムとして開発した．本符号化システムは，スプライトが効果があると推定されるショットを自動的に抽出するスプライトショット抽出法を取り入れ，符号化品質の向上を図った．また，ストリーミング配信への適用も考慮し，低遅延スプライトに適用する2層ビデオオブジェクトレート制御法を新たに考案した．これにより，コンスタントビットレートの符号化データの生成を可能とした．また，カメラの動きのあるスポーツ映像を用いて，本提案符号化システムの評価を行い，その優位性と適応条件について考察した．<br>

DOI CiNii
Dynamic ProgrammingとFeedback Algorithmによる閉曲線近似法

宮澤敏記, 中神央二, 渡辺裕, 富永英義

画像電子学会第30回年次大会 No.20 2002年06月
物体形状の推定と表面質感に関する一考察

巽英之, 渡辺裕, 大谷淳

画像電子学会第30回年次大会 No.19 2002年06月
二層構造によるアニメーション画像符号化の検討

中神央二, 宮澤敏記, 渡辺裕, 富永英義

画像電子学会第30回年次大会 No.15 2002年06月
コンテントオリエンティッド符号化実現のためシステム提案

清水直人, 宮澤敏記, 亀山渉, 渡辺裕, 富永英義

画像電子学会第30回年次大会 No.14 2002年06月
MPEG-2ビットストリームからのROI抽出とトランスコーダへの適用

田邉集, 泰泉寺久美, 渡辺裕, 富永英義

画像電子学会第30回年次大会 No.9 21 - 21 2002年06月

　概要を見る

様々な帯域が混在するネットワーク上でMPEG動画像配信を行なう際に、動画像のビットレースを伝送路の帯域に合わせて変換するトランスコーディングに関する研究が注目されている. しかし従来のトランスコーダは, ビットレート削減に伴う画質の劣化が全体に一様であり, 画像符号化方式として考えると効率的ではない. そこで本研究では, ユーザの興味領域の画質を保つ効率的なトランスコーディングの実現を目的とし, MPEG-2のビットストリームから前景領域を抽出し, トランスコーダ内で画質の制御を行う実験を行った.

DOI CiNii
JPEG2000トランスコーダにおけるROI自動抽出と評価

荻田健夫, 渡辺裕

画像電子学会第30回年次大会 No.10 2002年06月

CiNii
A Study on Extraction of Moving Objects Using Watershed Transform

S.Watanabe, K.Jinzenji, H.Watanabe

画像電子学会第30回年次大会 No.8 20 - 20 2002年06月

　概要を見る

本研究では、動画像シーケンスから画素単位で動物体を抽出することを目的としている。研究の背景として、像符号化の実現方法の一つである、MPEG4のスプライト符号化に着目した。スプライト符号化ではカメラモーションを含む動画像シーケンスから一枚のパノラマ背景画像と、前景オブジェクトの二層のオブジェクトを用いる。本研究では、背景差分法とウォーターシェッド変換をを用いて前景オブジェクトの画素単位での抽出を実現する。まず、背景差分法により前景候補領域を抽出し、ここから正確なオブジェクトの輪郭を抽出するためにウォーターシェッド変換を用いる。更に、従来法により生成されるマクロブロックで近似されたオブジェクトマスクを利用し、ウォーターシェッド変換により抽出された不要なエッジを削除する。

DOI CiNii
平滑化操作によるアニメーション画像符号化の検討

中神央二, 宮澤敏記, 渡辺裕, 富永英義

電子情報通信学会総合大会 D-11-54 2002年03月
平滑化操作によるアニメーション画像符号化の検討

中神央二, 宮澤敏記, 渡辺裕, 富永英義

情報処理学会オーディオビジュアル複合情報処理研究会 2002AVM36-11 2002年03月
デコーダダウンローダブルシステムにおけるフレキシブルデコーダ構成

清水直人, 宮澤敏記, 亀山渉, 渡辺裕, 富永英義

電子情報通信学会総合大会 D-11-119 2002年03月
デコーダダウンローダブルシステムにおけるスケジューリング方式の検討

宮澤敏記, 清水直人, 亀山渉, 渡辺裕, 富永英義

電子情報通信学会総合大会 D-11-120 2002年03月
スプライトを用いたMPEG-4超高圧縮符号化方式：VideoESPER

泰泉寺久美, 岡田重樹, 小林直樹, 渡辺裕, 米原紀子

情報処理学会オーディオビジュアル複合情報処理研究会 2002AVM36-8 ( 25 ) 43 - 51 2002年03月

　概要を見る

MPEG-4では従来符号化法にはない，強力な符号化ツールが具備されている．筆者らはスプライトという符号化ツールに着目し，動画像をあらかじめ前景背景に分離し，前景にオブジェクト符号化，背景をスプライト符号化するスプライトモード[2]の開発を行ってきた．しかし，すべての動画像にスプライトモードが適応できるとは限らなかった．筆者らは，スプライト符号化を用いた汎用的なMPEG-4動画像符号化システムである"VideoESPER (Efficient SPrite aided EncodeR)"を開発した．スプライトモードが優位なショットを自動的に抽出し，そのショットにはスプライトモードを適用し，そのほかのショットに対しては従来符号化法を踏襲するノーマルモードを適用するマルチモード符号化法を提案する．さらに，AVIデータ等を逐次的に処理するための逐次処理方法，グローバルな動きの算出法，前景領域抽出法，スプライト生成法に改良を行った．また，符号化においては，前景背景に自動的に符号量を配分するためのマルチビデオオブジェクト(MVO: Multi Video Object)レート制御法を提案し，符号化効率を上げるために，スプライトに幾何変換を行う中間解像度変換法，スプライト回転法を提案する．また，実際に符号化実験を行って，従来符号化方式に対するVideoESPERの優位性を示した．一方で，VideoESPERにおける限界についても述べる．MPEG-4, a new audio-visual coding standard has strong compression tools, such as sprite coding. We have focused on sprite coding and proposed "Sprite Mode". In sprite mode, video sequence is divided into foreground object and background object. Foreground object is compressed using MPEG-4 object coding., while background object is compressed using sprite coding. Sprite mode does not always adopt any video sequence such that images having still camera motion can be compressed in enough quality in low bit-rate. So, in this paper, we developed new MPEG-4 video encoder using sprite coding , VideoESPER , which means Video Efficient Sprite aided EncodeR. In Video ESPER, multi-mode coding is newly proposed. Video shot suitable for conventional coding method is coded by normal mode (conventional method), while video shot suitable for sprite coding is coded by sprite mode. These two modes are automatic selected in any video sequences. We improved the processing flow using stuck system in order to consequently process video such as AVI format. We improved global motion detection algorithm, foreground object and background sprite generation algorithm. Then we proposed the simple rate-control for sprite mode and geometric sprite transforming method for more efficient sprite coding. Then, we have some experiment on MPEG-4 video coding. The result shows effectiveness' of VideoESPER in low bit-rate coding. On the other hand, limitation of ViddeoESPER is also discussed.

CiNii
ROIの自動抽出に基づくJPEG2000トランスコーダの一検討

荻田健夫

電子情報通信学会総合大会 D-11-118 2002年03月

CiNii
MPEG-2 Bit Stream からの動領域抽出法の検討

田邉集, 泰泉寺久美, 渡辺裕, 富永英義

情報処理学会オーディオビジュアル複合情報処理研究会 2002AVM36-7 2002年03月
JPEG2000を用いた静止画中の物体の質感表現の検討

電子情報通信学会総合大会 D-11-172 2002年03月
Dynamic Programmingによるセル画アニメーション内均等色輪郭線符号化の検討

宮澤敏記, 中神央二, 亀山渉, 渡辺裕, 富永英義

情報処理学会オーディオビジュアル複合情報処理研究会 2002AVM36-12 ( 25 ) 71 - 76 2002年03月

　概要を見る

セル画アニメーション内には多数の均等色領域が存在する．この領域の輪郭線を有意点及び均等色を代表する色で近似することで圧縮率改善が図られる．そこで，本稿では輪郭線近似のための有意点をdynamic programmingを基に抽出し，輪郭を近似するアルゴリズムを提案する．Dynamic programmingを用いることで，誤差関数を基に，近似関数に最適な有意点を繰り返し抽出可能となる．Dynamicprogrammingは繰り返し演算に伴って，計算時間が大きくなることが問題である．そこで，本アルゴリズムの実行時間評価を行い，計算時間に影響を与える要素について検討を行った．一方，本アルゴリズムは，1)点数削減率を正確に制御できない，2)閾値によって輪郭線内に局所的に抽出点が発散してしまう問題がある．これらの問題を解決するためにFeedback algorithmを提案する．There are many homogeneous color regions in animation images. By approximating the regions by significant points and representative colors, coding efficiency can be improved. In this paper, we propose an approximation method for the regions based on dynamic programming. An approximation criterion used in the method is an error value defined by the distance between an original and an approximated line. If the error value is smaller than a threshold, a significant point that provides the minimum error value is extracted from the line. This process is iteratively executed until the error value exceeds the threshold. Since the the number of iteration tends to be large, we evaluate the processing time of significant elements, which is dominant for computational complexity. The remaining problems, in the proposed approximation, are 1) lack of point reduction ratio contorol and 2) excess extraction of significant points at some threshold. We also propose feedback algorithm in order to solve the problems.

CiNii
MPEG-4 very low bit-rate video compression by adaptively utilizing sprite to short sequences

Kumi Jinzenji, Shigeki Okada, Naoki Kobayashi, Hiroshi Watanahe

Proceedings - 2002 IEEE International Conference on Multimedia and Expo, ICME 2002 1 653 - 656 2002年 [査読有り]

　概要を見る

In MPEG-4, a video sequence can be divided into foreground object and background objects that are independently encoded. Using sprites can dramatically compress the overall bit rate, but not all video sequences can be so encoded. This paper introduces MPEG-4 multimode coding
it offers automatic coding mode decision, video object generation and high compression efficiency. The source video sequence is segmented and each segment is automatically categorized as either "normal", which is encoded using MPEG-4 simple profile, or as "sprite". Coding experiments show that if the bit rate is low, multimode coding offers higher coding efficiency than regular MPEG-4 in terms of frame rate and image quality.

DOI
Pixel-based extraction of moving objects for sprite coding

S Watanabe, K Jinzenji, H Watanabe

6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS D-11-96 195 - 200 2002年 [査読有り]

　概要を見る

In this paper, a method to extract moving objects on a pixel base for MPEG-4 "sprite coding" is proposed. Sprite coding is a form of object coding: it uses a unified panoramic background image derived from a sequence having a camera motion, and a foreground object as video object planes (VOP's). The proposed algorithm utilizes background difference, which is the difference between the original image and the background image, and watershed transform to extract the foreground moving objects. We first apply background difference to generate a foreground candidate image. Then, watershed transform is applied to this candidate image to extract the contours of the foreground objects. Furthermore, we utilize the macroblock mask of the foreground objects, which is generated by the conventional method, to delete the unnecessary edges extracted by watershed transformation. Results given by our proposed method are more than twice as better than that of the conventional method.
Dynamic ProgrammingとFeedback Algorithmによる閉曲線近似法

宮澤敏記, 中神央二, 渡辺裕, 富永英義

画像電子学会第30回年次大会 No.20 ( 489 ) 484 - 488 2002年
A study on content-oriented coding scheme and decoder downloadable system

O Nakagami, N Shimizu, T Miyazawa, W Kameyama, H Watanabe, H Tominaga

6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS Proc. Vol. III 111 - 116 2002年 [査読有り]
A study on two-layer coding for animation images

O Nakagami, T Miyazawa, H Watanabe, H Tominaga

IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS WedAmPO3: Compression II 621 - 624 2002年 [査読有り]

　概要を見る

A coding scheme specifically designed for animation images is proposed. Taking characteristics of animation images into account, lines and homogeneous color regions are extracted from animation images. Lines and contours of the homogeneous color regions are approximated by straightline and spline functions. We found that smoothing operations are effective to extract homogeneous regions from background images. Connected Filter is used for smoothing operation while keeping edges of the original image correctly. Precise approximation of the contours is also required. For this purpose, we propose to use Dynamic Programming with a feedback algorithm. As a result, an animation image can be represented by two layers; significant points as a base layer, and DCT blocks for an additional layer. For compensation scheme to the loss caused by smoothing operation, DCT is applied to the difference between the original images and lines and homogeneous color regions images. High-quality images can be obtained by adding this differential data to the approximated images.
MPEG-4 very low bit-rate video compression by adaptively utilizing sprite to short sequences

K Jinzenji, S Okada, N Kobayashi, H Watanabe

IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS WedAmPO3: Compression II 653 - 656 2002年 [査読有り]

　概要を見る

In MPEG-4, a video sequence can be divided into foreground object and background objects that are independently encoded. Using sprites can dramatically compress the overall bit rate but not all video sequences can be so encoded. This paper introduces MPEG-4 multi-mode coding; it offers automatic coding mode decision, video object generation and high compression efficiency. The source video sequence is segmented and each segment is automatically categorized as either "normal", which is encoded using MPEG-4 simple profile, or as "sprite". Coding experiments shows that if the bit rate is low, multi-mode coding offers more high coding efficiency than regular MPEG-4 in terms of frame rate and image quality.
A study on two-layer coding for animation images

O Nakagami, T Miyazawa, H Watanabe, H Tominaga

IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS WedAmPO3: Compression II 621 - 624 2002年

　概要を見る

A coding scheme specifically designed for animation images is proposed. Taking characteristics of animation images into account, lines and homogeneous color regions are extracted from animation images. Lines and contours of the homogeneous color regions are approximated by straightline and spline functions. We found that smoothing operations are effective to extract homogeneous regions from background images. Connected Filter is used for smoothing operation while keeping edges of the original image correctly. Precise approximation of the contours is also required. For this purpose, we propose to use Dynamic Programming with a feedback algorithm. As a result, an animation image can be represented by two layers; significant points as a base layer, and DCT blocks for an additional layer. For compensation scheme to the loss caused by smoothing operation, DCT is applied to the difference between the original images and lines and homogeneous color regions images. High-quality images can be obtained by adding this differential data to the approximated images.
A novel decoder-downloadable system for content-oriented coding

N Shimizu, T Miyazawa, W Kameyama, H Watanabe, H Tominaga

GLOBECOM'02: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-3, CONFERENCE RECORDS GEN-01-1 1638 - 1642 2002年 [査読有り]

　概要を見る

In this paper, a new system architecture called decoder-downloadable system is described. The purpose of this system is to provide a uniform platform for the multimedia world based on image compressions using characteristics of the contents. This system enables dynamic decoder downloading by negotiation between servers and clients for seamless and minimum delay playback. This negotiation scheme is implemented by Java and CORBA (Common Object Request Broker). Thus, servers and clients in this system are independent from OSes and hardware specifications. This system is suitable for TV broadcasting as well as Internet streaming.
We improve a decoder architecture in order to make the system more flexible. A decoder has some functions such as bitstream parsing, image compression algorithms (DCT, Wavelet, Motion estimation and so on). However, conventional decoders are monolithic software. Thus, it is impossible to share parts of decoders. The proposed method enables a module-based decoder and sharing some modules among some decoders. Consequently, the flexible decoder can make downloading time shorter by avoidance to download redundant parts of decoders. It also achieves scalabillity enabling this system to be used in some multimedia applications, such as a multimedia content search system, as well as a multimedia player.
Pull型サービスにおけるデコーダダウンローダブルシステムの構成及び評価

宮澤敏記, 清水直人, 亀山渉, 渡辺裕, 富永英義

画像符号化シンポジウム PCSJ2001 P-P2.06 2001年11月
箱で運んだ会議ドキュメント

渡辺裕

画像電子学会誌 30;5;pp.629-630 2001年09月
Sprite Coding in Object-based Video Coding Standard: MPEG-4

H.Watanabe, K.Jinzenji

World Multiconference on Systemics, Cybernetics and Informatics (SCI) 2001 Vol.XIII;pp.420-425 2001年07月 [査読有り]
Animation Image Coding

T.Miyazawa, W.Kameyama, H.Watanabe, T.Sakatani, H.Tominaga

World Multiconference on Systemics, Cybernetics and Informatics (SCI) 2001 Vol.XIII;pp.404-409 2001年07月 [査読有り]
MPEG-4スプライト符号化を用いた超低レート動画像圧縮

秦泉寺久美, 渡辺裕, 岡田重樹, 小林直樹

電子情報通信学会論文誌D-II J84-D-II;5;pp.758-768 ( 5 ) 758 - 768 2001年05月 [査読有り]

　概要を見る

新しい画像符号化標準のMPEG-4が標準化されつつある。これは、低いレートにおいてはH.26XやMPEG-1,2に代表される従来符号化法と比較してより高品質の画像を提供するものである。また、新しい機能であるオブジェクト単位の符号化を提供するものである。筆者らはインターネット等に適用できる超低ビットレート符号化方法の開発を行っている。従来法に比べて劇的な符号量削減が可能であるMPEG-4の符号化ツールである「スプライト符号化」に着目し、前景、背景の2層からなるビデオオブジェクトの自動抽出アルゴリズムを提案する。生成した前景、背景のビデオオブジェクトをそれぞれMPEG-4オブジェクト符号化並びにスプライト符号化に適用(スプライトモード)して、スプライトを用いないMPEG-4 Simple profileの符号化法(ノーマルモード)と比較検討を行った。特に、フレームレートや前景オブジェクトの割合を変化させ、符号量に与える影響を調査した。その結果、フレームレートに関係なく、前景比率が画面全体の10〜15%程度である場合にMPEG-4通常符号化方法の1/4〜1/2程度の符号量で同程度の画質を実現できることを確認した。また超低レート(128kbit/s、64kbit/s)においてフレームレート、客観画質の評価を行ったところ、同程度のSNRで倍以上のフレームレートを達成できることを確認した。

CiNii
An Approach to MPEG-4 Multi Mode Coding Using Sprite Coding

S.Okada, K, Jinzenji, H.Watanabe, N.Kobayashi

International Picture Coding Symposium PCS2001 pp.421-424 421 - 424 2001年04月 [査読有り]

CiNii
アニメーション画像符号化の基礎検討

宮澤敏記, 亀山渉, 渡辺裕, 阪谷徹, 富永英義

電子情報通信学会 D-11-16 66 - 66 2001年03月

CiNii
アニメーション画像符号化における輪郭線抽出と近似方式の検討

宮澤敏記, 亀山渉, 渡辺裕, 阪谷徹, 富永英義

情報処理学会AVM研究会 AVM32-12, pp. 65-70 2001年03月
Information Technology:Multimedia Standards-what's next?

Hiroshi Watanabe

ISO Bulletin 32-1 page4 2001年01月
スプライト符号化を利用したMPEG-4マルチモード符号化方式の研究

岡田重樹, 秦泉寺久美, 渡辺裕, 小林直樹

画像符号化シンポジウムPCSJ2000 P-P. 2. 05 2000年11月

CiNii
MPEG-4, JPEG2000符号化の現在と将来像

画像符号化シンポジウムPCSJ2000 パネリストセッション 2000年11月
Visual Communication in Mobile Era

MoMuC2000 パネリスト 2000年10月
SC29 and Multimedia

MoMuC2000 キーノートスピーカー 2000年10月
MPEG-4スプライト符号化を用いた超低レート動画像圧縮

秦泉寺久美, 岡田重樹, 渡辺裕, 小林直樹

画像電子学会研究会 0-03-02;pp.7-13 7 - 13 2000年09月

CiNii
参照フレーム変更によるビットストリームスケーリング(共著)

清水淳, 嵯峨田淳, 渡辺裕, 小林直樹

電子情報通信学会論文誌D-(]G0002[) J83-D-II,3,967-976 ( 3 ) 967 - 976 2000年03月

　概要を見る

インターネットでは, ユーザの接続環境により伝送レートや伝送経路の品質も様々である.このため, インターネット上で映像伝送サービスを提供するには, 符号化ビットストリームから, 実時間で任意の伝送レートに変換するビットストリームスケーリング技術が必要となる.従来, スケーリング方法として, DCT係数の符号量を削減滅する方法や, 他のフレームから参照されない符号化レームを間引く方法が提案されている.しかし, インターネット映像伝送サービスを提供するには, 従来方法のスケーリングの自由度では, 不十分である.そこで, 本論文では, 演算コストを抑えた参照フレームを間引くビットストリームスケーリング方法を提案する.提案方法は, 動きベクトル検索の代わりに, もとの動きベクトルから新たな動きベクトルを推定し, 動き補償予測の参照関係を変更する.これにより演算コストを抑えつつ, スケーリングの自由度が拡大できることを示す.また, 推定した動きベクトルと復号フレームから検出した動きベクトルを比較し, 動きベクトル推定方法の確度についても検証する.

CiNii
背景スプライトを用いた動画像圧縮に関する一考察

秦泉寺, 岡田重樹, 渡辺裕, 小林直樹

電子情報通信学会全国大会 D-11-52 52 - 52 2000年03月

CiNii
参照フレーム変更によるビットストリームスケーリング

清水淳, 嵯峨田淳, 渡辺裕, 小林直樹

電子情報通信学会論文誌D-II J83-D-II,3,pp.967-976 2000年03月

CiNii
スプライト符号化に適したショット判定方法

電子情報通信学会全国大会 D-11-53 2000年03月
スプライト符号化に適したショットの判定方法

岡田重樹, 秦泉寺久美, 渡辺裕, 小林直樹

情報処理学会オーディオビジュアル複合情報処理研究会 2000-AVM-28,28-11,pp.61-66 ( 24 ) 61 - 66 2000年03月

　概要を見る

MPEG-4 Main Profileでサポートされているスプライト符号化は，オブジェクト符号化方式の一種で，符号量を劇的に削減できる方式である。筆者らは，この符号化方式を利用し，インターネット等に適用できる高品質超低ビットレートを実現するアプリケーションの研究開発を行っている。しかし，スプライト符号化は全ての映像に適用できないという問題点がある。本稿では，スプライト符号化に適している映像を分析し，また，この判定を効率的に行う手法を提案する．Sprite coding is supported by the new video coding standard MPEG-4 Ver.1 Main Profile. It provides content-based functionality and low bit-rate video compression. We focused on this coding technique in order to develop high quality and very low bit-rate application that is effective in narrow-band transmission such as Internet. In this paper, we analyze that what kind of video sequence is suitable for sprite coding by checking a few parameters. This paper also proposes an automatic shot decision algorithm for sprite coding.

CiNii
Global Motion Estimation for Sprite Production and Application to Video Coding

JINZENJI K.

IEEE ISPACS'98 J83-D-II,2,535-544 328 - 332 2000年02月

CiNii
スプライト生成のためのグローバルモーション算出法と符号化への適用

秦泉寺久美, 渡辺裕, 小林直樹

電子情報通信学会論文誌D-II J83-D-II;2,pp.535-544 ( 2 ) 535 - 544 2000年02月

　概要を見る

スタティックスプライトを用いた符号化では, スタティックスプライトの生成精度が符号化効率, SNR, 主観評価に大きく影響する.動画像からスタティックスプライトを生成するにはアウトライヤ(外れデータ)の影響のないカメラモーションだけを反映したグローバルモーションを算出する必要がある.本論文では, 局所的動きベクトルとその微分値の特徴空間での分布の特性を利用し, グローバルモーションの複数候補による検定手法を用いた, アウトライヤの影響の少ないグローバルモーション算出法を提案する.また, 提案手法を用いて算出されたグローバルモーションをMPEG-4VMVer.11に適用して, (1)LME符号化, (2)GME符号化, (3)ダイナミックスプライト符号化, (4)スタティックスプライト符号化を行い, それぞれの符号量と画品質を評価した.その結果, スタティックスプライト符号化はLME符号化, GME符号化, ダイナミックスプライト符号化に比べて高圧縮率を達成できることがわかった.

CiNii
Automatic two-layer video object plane generation scheme and its application to MPEG-4 video coding

K Jinzenji, S Okada, H Watanabe, N Kobayashi

ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL III pp.606-609 606 - 609 2000年 [査読有り]

　概要を見る

The new video coding standard MPEG-4 provides content-based functionality and low bit-rate video compression. We focused on the "sprite coding" supported MPEG-4 Version 1 Main profile in order to achieve "VHS quality video on 2B (128bps)" for narrow-band transmission such as the Internet. Automatic VOP (Video Object plane) generation technologies are being studied as one of the most important issues of MPEG-4 object coding. This paper proposes a two-layer VOP generation scheme with some core algorithms such as Gh IE (Global Motion Estimation). foreground moving object extraction, and background sprite generation. This paper also describes a shape information reduction method for foreground objects. Using this method, shape information is compressed by 90%. Experiments are conducted on VOP generation and video coding with MPEG-4. Coding efficiency is 3-4 times higher than that of typical existing video coding schemes at the same subjective image quality.
二層ビデオオブジェクト自動生成とMPEG-4符号化への適用

秦泉寺久美, 渡辺裕, 小林直樹

電子情報通信学会画像工学研究会 IE99-115,pp.49-54 ( 80 ) 49 - 54 1999年12月

　概要を見る

新しい画像符号化標準のMPEG-4が標準化されつつある.これは, 新しいファンクショナリティであるオブジェクト単位の符号化を提供するものである.著者らはインターネットアプリケーションなどに適用できる超低ビットレート符号化方法の研究を行っており, "VHS on 2B"(128kbpsでVHSビデオ並の品質)の符号化方式の開発を行っている.本稿では, MPEG-4 Version 1 Main profileにてサポートされている「スプライト符号化」に着目し, スプライト符号化の課題の一つであるビデオオブジェクトの自動抽出アルゴリズムを提案した.また, 提案アルゴリズムをMPEG-4符号化して従来符号化方法と比較検討を行った.同程度の画質を, 従来符号化方法の1/2以下の符号量で実現した.

DOI CiNii
スプライン/DCTハイブリッド符号化方式を用いたアニメーション映像における領域抽出の検討

映像情報メディア学会マルチメディア情報処理研究会 MIP99-76,pp.23-30 1999年11月
Next-generation multimedia coding standard: MPEG-4

H Watanabe, S Takamura, T Moriya

NTT REVIEW 11 ( 6 ) 73 - 74 1999年11月 [査読有り]

　概要を見る

PEG-LC is an international standard for multimedia coding that has been defined by ISO/IEC JTC 1/SC 29/WG 11. Conventional audio-visual coding is for compression of natural images, natural audio and speech, whereas, MPEG-LC is for compression of synthetic images such as computer graphics and synthetic audio as well as natural audio-visual sources. These sources are separately coded as different objects at the encoder so that they can be decoded according to user preference. This article describes MPEG-4 version one, for which the Final Draft International Standard (FDIS) voting phase has finished (1999).
演算量低減を考慮したグローバル動き・輝度変化補償動画像符号化

上倉一人, 渡辺裕, 小林直樹, 一之瀬進, 安田浩

電子情報通信学会論文誌B J82-B;9,pp.1676-1688 ( 9 ) 1676 - 1688 1999年09月

　概要を見る

画面全体に生じる動きや時間的輝度変化への対処法としてグローバル動き補償,グローバル輝度変化補償が提案されており,これらにより符号化効率の大幅な改善が期待できる.本論文では,特にグローバル輝度変化補償で問題であった膨大な演算量を低減するために,ブロック単位で行われる動き推定処理及び輝度変化パラメータ推定処理に簡易な手法を導入するとともに,その動き推定処理をグローバル動き補償,ローカル動き補償時の動き推定処理と共通化することにより,少ない演算量でこれらの機能を併せもつ符号化方式を提案する.実験で用いたブロックサイズ,動き探索範囲の場合,従来の各補償方法を単純に組み合わせる場合に比べ乗算回数が約1/400に低減される.また符号化効率の面では,グローバル動き補償と輝度変化補償を個別に適用した場合の効率改善効果を併せもつ効果が得られることを示す.

CiNii
アニメーション映像における領域抽出の一検討

古角

画像符号化シンポジウム(PCSJ '99) P.3.15,pp.65-66 65 - 66 1999年09月

CiNii
VOP自動生成とMPEG-4符号化への適用

画像符号化シンポジウム(PCSJ '99) P.4.03,pp.73-74 1999年09月
MPEG-2ビデオ符号化チップSuperENCにおける1-passVBR制御方式

画像符号化シンポジウム(PCSJ '99) P.1.02,pp.5-6 1999年09月
次世代マルチメディア符号化国際標準MPEG-4

NTT技術ジャーナル 11;8,pp.56-57 1999年08月
Automatic VOP Production Scheme for Very Low Bit Rate Video Coding

Picture Coding Symposium '99 pp.299-302 1999年04月
Automatic VOP Production Scheme for Very Low Bit Rate Video Coding

JINZENJI K.

Picture Coding Symposium '99 pp.299-302 299 - 302 1999年04月

CiNii
Global brightness-variation compensation for video coding

K Kamikura, H Watanabe, H Jozawa, H Kotera, SN Ichinose

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 8 ( 8 ) 988 - 1000 1998年12月 [査読有り]

　概要を見る

In this paper, a global brightness-variation compensation (GBC) scheme is proposed to improve the coding efficiency for video scenes that contain global brightness variations caused by fade in/out, camera-iris adjustment, flicker, illumination change, and so on. In this scheme, a set of tno brightness-variation parameters, which represent multiplier and offset components of the brightness variation in the whole frame, is estimated. The brightness variation is then compensated using the parameters. We also propose a method to apply the GBC scheme to component signals for video coding. Furthermore, a block-by-block ON/OFF control method for GBC is introduced to improve coding performance even for scenes including local variations in brightness caused by camera Bashes, spotlights, and the like, Simulation results show that the proposed CBC scheme with the ON/OFF control method improves the peak signal-to-noise ratio (PSNR) by 2-4 dB when the coded scenes contain brightness variations.
27-6 フレーム間差分の累積を用いた動領域抽出法

秦泉寺久美, 渡辺裕, 小林直樹

映像情報メディア学会年次大会講演予稿集 1998 415 1998年

　概要を見る

本稿では, MPEG-4におけるVOPの自動生成を目的として, フレーム間差分の累積とMorphological filterを用いた動領域の抽出法を提案し, 抽出実験を行い良好な結果を得たので報告する.

DOI CiNii
動的演算量スケーラブルアルゴリズムによるソフトウェア画像符号化(共著)

電子情報通信学会論文誌D-(]G0002[) J80-D-II,2,444-458 1997年02月
Obtaining higher resolution in Asia: Advances in digital signal processing for HDTV

Hiroshi Watanabe, Hiroshi Yasuda

IEEE Signal Processing Magazine 14 ( 4 ) 52 - 64 1997年

DOI
64 kbit/s Video coding algorithm using adaptive gain/shape vector quantization

Hiroshi Watanabe, Yutaka Suzuki

Signal Processing: Image Communication 1 ( 2 ) 87 - 102 1989年

　概要を見る

A hybrid coding algorithm, consisting of motion compensated interframe prediction and adaptive gain/shape vector quantization, is proposed for low bit rate video coding. A video coding system using the proposed coding algorithm and its coding characteristics are also described. Motion compensated interframe prediction is an efficient technique for reducing temporal redundancy contained in moving pictures. In the proposed scheme, constant amplitude blocks ("gray level blocks") are also included in search vectors in order to increase coding efficiency for quick object movement or scene change. Adaptive gain/ shape vector quantization with a tree search codebook is employed to encode motion compensated interframe difference signals. It can respond to changes in source statistics because its codebook is independent of the gain components of the input vectors. The proposed coding algorithm has been evaluated by computer simulations and shown to be effective for low bit rate video transmission. A 64 kbit/s video coding system based on this coding algorithm has been implemented. The developed coding system can transmit multiplexed video, audio and digital data at the basic ISDN rate, and can be applied to videophone as well as videoconferencing. © 1989.

DOI
MPEG-2 Coding Control Technology

NTT R&D 46/6,589-594
MPEG2符号化制御技術(共著)

NTT R&D 46/6,589-594

▼全件表示

書籍等出版物

そこが知りたい最新技術オーディオ・ビデオ圧縮入門

亀山, 金子, 渡辺

(株)インプレスR 2007年04月
改訂版ディジタル放送教科書（上）（下）

亀山, 花村, 渡辺ほか

（株）インプレス 2004年10月 ISBN: 4844320270
Fundamentals of Digital Image Compression

1996年
ディジタル画像圧縮の基礎(共著)

日経BP社 1996年

共同研究・競争的資金等の研究課題

機械学習による人物運動解析の高精度化

日本学術振興会科学研究費助成事業

研究期間:

2020年04月

-

2023年03月

渡辺裕

　概要を見る

本研究では、映像のみから人物の姿勢情報を機械学習により推定し、推定された人物の骨格座標を用いて運動解析や動作解析を行い、動作認識や個人認識の識別率の向上を目指す。機械学習を用いた骨格情報推定アルゴリズムは、非接触なアプローチであることに大きな利点がある。したがって、これらのアルゴリズムはスポーツ運動分析などの分野に対して幅広く適用できると考えられる。スポーツ運動分析や介護姿勢推定の分野では、パターン認識に加えて、推定された骨格情報の位置、角度、同期性、一致度の解析が望まれる。これらの解析結果から、運動や動作のパフォーマンスを向上させる動きに関する知識を獲得する
大規模マンガデータベースのための自動要約生成の研究

日本学術振興会科学研究費助成事業

研究期間:

2017年04月

-

2020年03月

渡辺裕, 柳澤秀彰

　概要を見る

2019年度は、キャラクタ顔画像分類の改善およびマンガ要約手法について検討した。これまで得られた研究成果を基にマンガの要約手法を確立するため、画像情報を用いたマンガ要約生成手法を提案し、主観的評価による検証を行った。評価実験では本学の生徒13名を研究補助者として雇用し、要約評価アンケートを実施した。本年度の研究成果として以下の3点が挙げられる。(1) キャラクタ顔画像のクラスタリングの高精度化を目指して、自己教師あり学習手法であるDeepClusterを組み合わせたCNNの学習について検討した。しかし、DeepClusterを組み合わせると顔画像の分類精度が低下することが分かった。(2) キャラクタ顔画像クラスタリングの高性能化を目的として、密度ベースのクラスタリング手法であるOPTICSおよびHDBSCANの適用を検討した。DBSCANとOPTICSの比較では、それぞれAMIが最大値をとるようにパラメタを設定した場合において、DBSCANは67.1%、OPTICSは58.7%の精度を示した。(3) 画像情報を用いたマンガ要約の生成について、主観的評価により有効性の検証を行った。「ランダムにページを抽出した要約(要約A)」、「大きなコマを持つページとその前後ページを抜き出した要約(要約B)」、「キャラクタの初登場シーンに着目した要約(要約C)」、「ロールコミュニティモデルを適用した要約(要約E)」の4手法に対してアンケートによるスコア付与を行った。実験結果からは、総合的な評価では提案手法の優位性は確認できなかった。しかし、特定の作品に対しては評価の高い要約が得られた。また、要約生成の自動化を検討するため、要約Cおよび要約Eについて、キャラクタ検出とキャラクタ顔画像分類を用いて生成した要約(要約D・要約F)との比較を行った。実験では、要約の満足度と情報量に関して自動化によるスコアの低下が見られた一方で、あらすじの理解度に関してはほぼ同等のスコアが得られた
マンガ検索のための自動要約生成に関する研究

日本学術振興会科学研究費助成事業

研究期間:

2013年04月

-

2016年03月

渡辺裕, 石井大祐

　概要を見る

マンガの自動要約を実現するためには、マンガに含まれるメタデータの抽出が必要である。このうち最も重要なものは登場キャラクターである。本研究では、キャラクター検出の高精度化を進めた。その結果、HOG特徴量を改良したDeformable Part Model (DPM) が有効であることを示した。また、事前知識がない状態からクラスリングにより主要キャラクターを同定する手法について検討した。これにより、正例と負例をDPMに与えることが可能となった。さらに、ロールコミュニティーモデルというキャラクターの登場場所・回数を検査する手法により、特定ページの重要度が決定でき、自動要約が実現できることがわかった
マンガ検索のための自動要約生成に関する研究

科学研究費助成事業(早稲田大学) 科学研究費助成事業(基盤研究(C))

研究期間:

2013年

-

2015年

　概要を見る

研究目的：マンガエピソードの自動要約生成には、登場キャラクターリストの自動生成が必須である。キャラクターの抽出には、その顔画像抽出が最も有効である。したがって、自然画像からの顔画像抽出を目的として開発されてきたハール変換型(Haar-like)の特徴量を、線画像主体で描かれるマンガ画像に対しても動作するように修正・拡張を行い、抽出精度の向上を目指している。
研究方法：キャラクター抽出を目的とした特徴量として、対象をハール変換型(Haar-like)のみに限定せず、勾配ヒストグラム(HOG)を基盤としたDeformable Part Modelなどの最新の手法も線画像に適用してその効果を確認した。また判別にはサポートベクターマシーン(SVM)を用いることとした。
キャラクター抽出には、マンガのキャラクター顔画像を数多く集積したデータセットが必要となる。そこで研究に協力的なマンガ作家の好意により、いくつかのマンガを提供してもらい顔画像（正例）切出し及び不正解画像（負例）切出しを行い、データベース化に向けた作業に入った。
研究成果：1. 勾配ヒストグラム(HOG)特徴量を用いてキャラクター抽出を試みる場合、正例と負例画像の識別の際にキャラクターが含まれる背景画像も抽出性能に影響を与えることが分かった。背景がセリフ入りの吹き出し画像、オノマトペが含まれる画像、それ以外に3分類した場合、オノマトペが背景に存在する場合には、キャラクター抽出への影響は小さいことが分かった。（電子情報通信学会総合大会において発表）
2. 勾配ヒストグラム(HOG)を基盤としたDeformable Part Modelは、顔画像中のいくつかの部分の相対的位置関係を利用するため、ハール変換型(Haar-like)特徴量よりもマンガキャラクターの顔画像抽出精度が高いことが分かった。（IEVC2014に投稿中）
オブジェクトベースの映像符号化

研究期間:

2000年

-

2001年
アニメーション映像符号化

研究期間:

2000年

-

▼全件表示

Misc

スタティックスプライト符号化における符号化効率の一検討

秦泉寺久美, 渡辺裕, 小林直樹

電子情報通信学会ソサイエティ大会講演論文集 1998 143 1998年09月

CiNii
スタティックスプライト生成のためのグローバルモーション算出法と符号化への適用

秦泉寺久美, 渡辺裕, 石橋聡, 小林直樹

電子情報通信学会技術研究報告. PRMU, パターン認識・メディア理解 98 ( 206 ) 47 - 53 1998年07月

　概要を見る

スタティックスプライトを用いた符号化では, スタティックスプライトの生成精度が符号化効率, SNR, 主観評価に大きく影響する.動画像からスタティックスプライトを生成するにはアウトライヤの影響のないカメラモーションだけを反映したグローバルモーションを算出する必要がある.本稿では, アウトライヤの影響の少ないグローバルモーション算出法を提案する.また, 提案手法を用いて算出されたグローバルモーションを使用して, MPEG-4VMVer.11を用いて, (1)LME符号化, (2)GMC符号化, (3)ダイナミックスプライト符号化, (4)スタティックスプライト符号化を行い, それぞれの符号量と画品質を評価した.その結果, スタティックスプライト符号化はLME符号化, GMC符号化, ダイナミックスプライト符号化に比べて高圧縮率を達成できることが分かった.

CiNii
スタティックスプライト生成のためのグローバルモーション算出法と符号化への適用

秦泉寺久美, 渡辺裕, 石橋聡, 小林直樹

電子情報通信学会技術研究報告. IE, 画像工学 98 ( 204 ) 47 - 53 1998年07月

　概要を見る

スタティックスプライトを用いた符号化では, スタティックスプライトの生成精度が符号化効率, SNR, 主観評価に大きく影響する.動画像からスタティックスプライトを生成するにはアウトライヤの影響のないカメラモーションだけを反映したグローバルモーションを算出する必要がある.本稿では, アウトライヤの影響の少ないグローバルモーション算出法を提案する.また, 提案手法を用いて算出されたグローバルモーションを使用して, MPEG-4VMVer.11を用いて, (1)LMB符号化, (2)GMC符号化, (3)ダイナミックスプライト符号化, (4)スタティックスプライト符号化を行い, それぞれの符号量と画品質を評価した.その結果, スタティックスプライト符号化はLME符号化, GMC符号化, ダイナミックスプライト符号化に比べて高圧縮率を達成できることが分かった.

CiNii
スタティックスプライト生成のためのグローバルモーション算出法と符号化への適用

秦泉寺久美, 渡辺裕, 石橋聡, 小林直樹

電子情報通信学会技術研究報告. MVE, マルチメディア・仮想環境基礎 98 ( 208 ) 47 - 53 1998年07月

　概要を見る

スタティックスプライトを用いた符号化では, スタティックスプライトの生成精度が符号化効率, SNR, 主観評価に大きく影響する.動画像からスタティックスプライトを生成するにはアウトライヤの影響のないカメラモーションだけを反映したグローバルモーションを算出する必要がある.本稿では, アウトライヤの影響の少ないグローバルモーション算出法を提案する.また, 提案手法を用いて算出されたグローバルモーションを使用して, MPEG-4VMVer.11を用いて, (1)LME符号化, (2)GMC符号化, (3)ダイナミックスプライト符号化, (4)スタティックスプライト符号化を行い, それぞれの符号量と画品質を評価した.その結果, スタティックスプライト符号化はLME符号化, GMC符号化, ダイナミックスプライト符号化に比べて高圧縮率を達成できることが分かった.

CiNii

産業財産権

画像符号化方法、画像符号化装置および画像符号化プログラム

5561611

渡辺裕, 石川孝明

特許権
画像符号化方法、画像符号化装置および画像符号化プログラム

5626728

石川孝明, 渡辺裕

特許権
画像符号化装置、画像符号化方法および画像符号化プログラム

5035926

渡辺裕, 石川孝明

特許権
画像符号化装置、画像符号化方法および画像符号化プログラム

5076150

渡辺裕, 石川孝明

特許権

現在担当している科目

卒業論文Ａ（秋学期）

基幹理工学部

2026年秋学期
卒業論文Ａ　18前再　【前年度成績S評価者用】

基幹理工学部

2026年春学期
卒業論文Ｂ　18前再　【前年度成績S評価者用】

基幹理工学部

2026年秋学期
卒業論文Ｂ（春学期）　18前再

基幹理工学部

2026年春学期
卒業論文Ａ（秋学期）　18前再

基幹理工学部

2026年秋学期
卒業論文Ｂ　18前再

基幹理工学部

2026年秋学期
卒業論文Ａ　18前再

基幹理工学部

2026年春学期
情報理工学実験Ａ　【前年度成績S評価者用】

基幹理工学部

2026年秋学期
情報理工学実験Ａ

基幹理工学部

2026年秋学期
情報理工学実験Ｂ【前年度成績S評価者用】

基幹理工学部

2026年春学期
卒業論文Ａ　（集中）

基幹理工学部

2026年集中（春・秋学期）
情報理工学実験Ｂ

基幹理工学部

2026年春学期
プロジェクト研究Ｂ

基幹理工学部

2026年秋学期
プロジェクト研究Ａ

基幹理工学部

2026年春学期
マルチメディア工学Ｂ

基幹理工学部

2026年秋学期
卒業論文Ｂ（春学期）

基幹理工学部

2026年春学期
卒業論文Ｂ

基幹理工学部

2026年秋学期
卒業論文Ａ

基幹理工学部

2026年春学期
卒業論文Ａ　（集中）

基幹理工学部

2026年集中（春・秋学期）
卒業論文Ａ（秋学期）

基幹理工学部

2026年秋学期
卒業論文Ａ

基幹理工学部

2026年春学期
卒業論文Ｂ　18前再　【前年度成績S評価者用】

基幹理工学部

2026年秋学期
情報通信実験Ａ

基幹理工学部

2026年秋学期
卒業論文Ｂ（春学期）　18前再

基幹理工学部

2026年春学期
マルチメディア工学Ｂ【前年度成績S評価者用】

基幹理工学部

2026年秋学期
マルチメディア工学Ｂ

基幹理工学部

2026年秋学期
情報通信実験Ａ　【前年度成績S評価者用】

基幹理工学部

2026年秋学期
プロジェクト研究Ａ

基幹理工学部

2026年春学期
卒業論文Ｂ　18前再

基幹理工学部

2026年秋学期
卒業論文Ａ　18前再　【前年度成績S評価者用】

基幹理工学部

2026年春学期
卒業論文Ｂ

基幹理工学部

2026年秋学期
卒業論文Ａ（秋学期）　18前再

基幹理工学部

2026年秋学期
卒業論文Ａ　18前再

基幹理工学部

2026年春学期
プロジェクト研究Ｂ

基幹理工学部

2026年秋学期
卒業論文Ｂ（春学期）

基幹理工学部

2026年春学期
情報通信実験Ｂ【前年度成績S評価者用】

基幹理工学部

2026年春学期
情報通信実験Ｂ

基幹理工学部

2026年春学期
Multimedia Systems B

基幹理工学部

2026年秋学期
Project Research Spring

基幹理工学部

2026年春学期
Introduction to Computers and Networks

基幹理工学部

2026年春学期
Computer Science and Communications Engineering Laboratory B

基幹理工学部

2026年春学期
Image Processing

基幹理工学部

2026年春学期
Graduation Thesis A (Spring) [S Grade]

基幹理工学部

2026年春学期
Graduation Thesis A　(Spring)[S Grade]【For students enrolled before 2022】

基幹理工学部

2026年春学期
Graduation Thesis A (Fall)

基幹理工学部

2026年秋学期
Graduation Thesis A (Fall) [S Grade]

基幹理工学部

2026年秋学期
Graduation Thesis A (Spring)

基幹理工学部

2026年春学期
Graduation Thesis A　(Fall)[S Grade]【For students enrolled before 2022】

基幹理工学部

2026年秋学期
Graduation Thesis A　(Spring)【For students enrolled before 2022】

基幹理工学部

2026年春学期
Graduation Thesis B (Spring) [S Grade]

基幹理工学部

2026年春学期
Graduation Thesis A　(Fall)【For students enrolled before 2022】

基幹理工学部

2026年秋学期
Graduation Thesis B (Fall) [S Grade]

基幹理工学部

2026年秋学期
Graduation Thesis B (Fall)

基幹理工学部

2026年秋学期
Graduation Thesis B (Spring)

基幹理工学部

2026年春学期
Computer Science and Communications Engineering Laboratory A [S Grade]

基幹理工学部

2026年秋学期
Computer Science and Communications Engineering Laboratory A

基幹理工学部

2026年秋学期
Project Research Fall

基幹理工学部

2026年秋学期
修士論文（情報・通信）

大学院基幹理工学研究科

2026年通年
Master's Thesis (Department of Computer Science and Communications Engineering)

大学院基幹理工学研究科

2026年通年
Special Laboratory A in Computer Science and Communications Engineering

大学院基幹理工学研究科

2026年春学期
Multimedia and Standards

大学院基幹理工学研究科

2026年秋学期
Special Laboratory B in Computer Science and Communications Engineering

大学院基幹理工学研究科

2026年秋学期
Research on Audiovisual Information Processing

大学院基幹理工学研究科

2026年通年
オーディオビジュアル情報処理演習C

大学院基幹理工学研究科

2026年春学期
オーディオビジュアル情報処理演習B

大学院基幹理工学研究科

2026年秋学期
オーディオビジュアル情報処理演習A

大学院基幹理工学研究科

2026年春学期
オーディオビジュアル情報処理演習D

大学院基幹理工学研究科

2026年秋学期
オーディオビジュアル情報処理研究

大学院基幹理工学研究科

2026年通年
情報理工・情報通信特別実験B

大学院基幹理工学研究科

2026年秋学期
情報理工・情報通信特別実験A

大学院基幹理工学研究科

2026年春学期
マルチメディアシステムと国際標準

大学院基幹理工学研究科

2026年秋学期
情報理工・情報通信特別演習Ｂ

大学院基幹理工学研究科

2026年秋学期
情報理工・情報通信特別演習Ａ

大学院基幹理工学研究科

2026年春学期
オーディオビジュアル情報処理研究

大学院基幹理工学研究科

2026年通年
Seminar on Audiovisual Information Processing D

大学院基幹理工学研究科

2026年秋学期
Seminar on Audiovisual Information Processing C

大学院基幹理工学研究科

2026年春学期
Seminar on Audiovisual Information Processing B

大学院基幹理工学研究科

2026年秋学期
Seminar on Audiovisual Information Processing A

大学院基幹理工学研究科

2026年春学期

▼全件表示

他学部・他研究科等兼任情報

理工学術院大学院基幹理工学研究科
附属機関・学校グローバル・エデュケーション・センター

学内研究所・附属機関兼任歴

2024年

-

2026年

理工学術院総合研究所兼任研究員

特定課題制度（学内資金）

機械認識用画像符号化と拡散モデルを統合した 3D 動画像生成基盤

2025年

　概要を見る

映像から対象物の3次元形状を復元する技術は、コンピュータビジョン分野における主要な研究課題であり、3次元再構成と呼ばれる。再構成結果はメッシュやガウシアンが出力される場合もあるが、最も多いケースは点群としての表現である。点群から 3D モデルを生成する需要が高く、点群の表面再構成は重要な課題となっている。近年は深層学習を用いた手法が多数提案され、特に空間中の各点に距離などのスカラー値を割り当てる implicit neural representation が大きな影響力を持っている。しかし、これらは物体の内外の定義に依存するため、エッジや開境界をもつ非密閉形状の表現が難しい。これに対し、表面までの最短距離を符号なしで表すUnsigned Distance Function (UDF) は内外判定を必要とせず、密閉・非密閉の双方に対応できるため注目されている。LoSF-UDF は点群を小さな局所領域に分割して高精度な再構成を実現したが、局所領域の大きさが固定であるため、曲率の大きい部分では複数の面や層が混在しやすく、局所的な点密度も不足し、幾何構造を忠実に捉えにくいという問題があった。そこで本研究では、幾何学的複雑さに応じて局所領域サイズを適応的に変更する表面再構成法を提案する。複雑さは曲率で評価し、それに応じてパッチ半径を調整する。また、UDF のクエリ点は基本的に 1283 の一様格子上に配置し、高曲率領域のみ 2563 に局所高密度化する。さらにリサンプリングも曲率に基づいて行う。これにより、複雑領域での層の混在を抑え、平滑領域では十分な情報を確保しつつ、平均計算量を削減しながら安定した UDF 推定と高品質な表面再構成を実現した。これらを、InternationalConference on Image Electronics and Visual Computing (IEVC 2026) において発表した。
敵対的生成ネットワークを用いた映像・音楽生成の高精度化

2020年 Ahmad Yunnis Mousssa

　概要を見る

Ahmad Yunis Moussa and Hiroshi Watanabe: “Audio Translation with Conditional Generative Adversarial Networks,” IEEE The 2nd International Conference on Artificial Intelligence in Information and Communication (ICAIIC 2020), 9B-1, Feb. 2020を研究基盤として、深層学習のひとつである敵対的生成ネットワークを用いて音響信号を合成する際の音質改善を図った。
機械学習による人物運動解析の高精度化

2019年 Khin Sabai Htwe, 稲田健太郎, 大澤遼平, 山川敦也

　概要を見る

本研究では，映像のみから人物の姿勢情報を機械学習により推定し，推定された人物の部位座標ベクトルを目的に応じて処理することにより，動作認識や個人認識の識別率を向上させることに成功した．体操競技において倒立状態の姿勢推定を可能にした．ダンスのような集団行動における動作タイミングの一致度を推定する手法について提案した．さらに，手話解析への適用を行い，映像上で手の重なりがない条件下での認識率を改善した．また，野球における投球動作解析のように同一シーンの整合が必要となる場合に，推定された部位座標を基に特徴ベクトルを生成することで，時系列整合アルゴリズムを用いて同一シーンの自動抽出を行えることを示した．
マンガ要約生成の自動化に関する研究

2016年柳澤秀彰

　概要を見る

本研究では，Faster R-CNN(Regions with CNN features)を使用したマンガコンテンツの検出およびコマ内容の認識を提案した．コマやフキダシ，キャラクターといったオブジェクトは作品毎の形状変化が大きいため，既定の画像特徴量を用いた検出は困難である．従って，畳み込みニューラルネットワーク(CNN: Convolutional Neural Network)によって生成される特徴量のマンガへの適用について検討した．また，マンガコンテンツに固有な物体の配置に対応するため，複数の1クラス検出器の組み合わせによる検出手法を提案した．実験結果では，未知の5作品のマンガ画像に対する検出率とコマ構造認識率の観点から提案手法を評価した．その結果，52.8 - 84.9%の精度でコマ構造の認識が可能であることが分かった．
顔画像認識に基づいた動物の品種識別

2016年渡部宏樹

　概要を見る

本研究では，動物の顔画像に基づく品種識別性能向上のために，DCGANを用いたCNNのデータオーギュメンテーション手法を提案した．実験では，猫および犬の顔画像を対象とし，提案手法の性能を品種識別の正解率により評価した．その結果，DCGANを用いた類似画像追加と従来の画像変換によるデータオーギュメンテーション手法の組み合わせにより，猫の場合で0.8%，犬の場合で1.3%の正解率の向上を確認した．さらに，生成画像の品質が提案手法に及ぼす影響についての調査を行った．実験の結果，品質の良い画像を加える手法の方がより高い性能を発揮することが分かった．
超高精細ビデオ符号化のための動きベクトル検出の研究

2013年

　概要を見る

本研究では，動き推定誤差データの曲面近似に基づく画素補間が不要な小数画素精度動き推定手法を提案した．実験結果では，提案手法の性能をPSNRとビットレートの観点から評価した．その結果，MPEG-4 AVC (Advanced Video Coding) 及びMPEG-4 HEVC (High Efficiency Video Coding) のいずれの符号化方式と組み合わせても，R-D性能の点から劣化がほとんどなく且つ演算量が14-46%削減されることが分かった．まず，整数画素精度の動き推定マッチング誤差を，縦横画素位置を変数とする2次元データとして取扱い誤差曲面を生成した．次に，その曲面に対する数学的モデルとして，9個あるいは6個のパラメタからなる放物線関数(パラボリック形式)を用いた．予備実験を行い，実験から得られた誤差曲面とモデル関数とを照合した．その結果，曲面のある小領域では，中心位置の誤差演算時において45度方向に位置する画素の誤差からの寄与が非常に少ないことが分かった．したがって，中心位置とその縦横４画素からなる誤差に与える５個の係数（パラメタ）からなるモデルにより，十分誤差を近似できることが分かった．さらにより簡単な縦横分離形式でも十分な精度で近似できることが分かった．この１次元放物線を決定するパラメタの定義域についても検討した．また，放物線と比べてより自由度の高いいくつかのスプライン曲線の導入について検討した．スプライン曲線には，Cardinal-Spline, Hermite-Spline, Bézier-Spline, Non-Uniform Rational B-Spline (NURBS) などがある．このうち，3点の制御点で描ける2次形式のBézier-Splineによる誤差表面近似に基づく少数画素精度動き推定手法を提案した．実験の結果，整数画素精度の動き推定にBézier-Splineに基づく提案手法を組み合わせた方式は，より精度の細かなベクトルを選ぶ傾向にあることがわかった．Bézier-Splineの曲線を規定する係数の導出において、しきい値により場合分けすると，より性能が向上することが分かった．そこで提案方式をMPEG-HEVCと組み合わせ，その性能評価を行った．HEVCはAVCと比較して，符号化画像が同一の品質であれば，符号化レートが50%に削減できる超高圧縮符号化方式であり，2013年1月に仕様が凍結された最新の方式である．AVCは現在では幅広く普及しており，Blu-Ray Diskやハンディカム用のAVCHDで用いられている高圧縮符号化方式である．AVCでは動き補償のためのブロックサイズは主に16x16画素であったが，HEVCでは動き補償のためのブロックサイズが64x64画素にまで拡大可能となっている．したがってBézier-Splineの曲線を決定するしきい値を調節することとした．AVC，HEVCいずれの方式と組み合わせても，提案方式はR-Dの観点から劣化はなく，演算量が14-46%に低減できることが分かった．また処理時間をCPU time (msec) で評価した場合でも，13-36%に低減できることが分かった．したがって本提案方式は，8k画像 (水平画素数8000) のように超高精細に向かいつつある今後のテレビジョン画像を対象とする符号化器や，モバイル端末のようにCPU能力が低い符号化器において，演算処理量の低減に大きく貢献する技術である．

▼全件表示