研究者詳細 - 青木　義満

　

教職員略歴

　

研究分野・研究テーマ

　研究分野

　

研究活動

　

教育活動

　担当授業科目

　

社会活動

2026/01/23 更新

青木　義満（アオキ　ヨシミツ）

Aoki, Yoshimitsu

写真a

所属（所属キャンパス）: 理工学部電気情報工学科（矢上）

職名: 教授

HP

http://www.aoki-medialab.jp/

特記事項: 教授

外部リンク

このページの先頭へ▲

総合紹介【表示／非表示】

・1999年04月-2001年03月早稲田大学理工学部　応用物理学科助手　橋本周司教授の研究室において、顔画像認識・合成、工業用精密画像計測、　ヒューマノイドロボットの視覚システムに関する研究に従事．・2002年04月-2005年03月芝浦工業大学工学部情報工学科専任講師（青木研究室発足）　2005年04月-2008年3月芝浦工業大学工学部情報工学科　准教授　顔形状・動作の３次元画像解析技術の医学・歯学応用　衛星画像他リモートセンシングデータの統合活用に関する研究　道路交通画像システム，高精度画像計測システムに関する研究等に従事．　※芝浦工業大学にて、７年間で約９０名の学生の研究指導を担当・2008年04月-現在　慶應義塾大学理工学部電子工学科　准教授　人物を対象とした画像計測・認識技術、及び応用システムに関する研究．　応用先として，セキュリティ，マーケティング，医療・福祉，美容，インターフェース，エンターテイメント，自動車，等を視野に入れ，幅広い産業応用を目指す．　人の認知機構や感性を考慮したメディア理解技術とその応用，新しい視覚センサ，ロバスト画像特徴量に関する研究等に従事．・2013年2月-現在　株式会社イデアクエスト　取締役兼任　慶應理工発画像センシング技術の医療分野での実用化を目指している．

このページの先頭へ▲

経歴【表示／非表示】

1999年04月

-

2002年03月

早稲田大学, 理工学部　, 助手
2002年04月

-

2005年03月

芝浦工業大学　, 工学部　情報工学科, 専任講師
2005年04月

-

2008年03月

芝浦工業大学, 工学部　情報工学科, 助教授（2007より准教授）
2008年04月

-

2017年03月

慶應義塾大学, 理工学部, 准教授
2013年02月

-

2017年03月

株式会社イデアクエスト, 取締役

全件表示 >>

このページの先頭へ▲

学歴【表示／非表示】

1996年03月

早稲田大学, 理工学部, 応用物理学科

大学, 卒業
1998年03月

早稲田大学, 理工学研究科, 物理学及応用物理学専攻

大学院, 修了, 修士
2001年02月

早稲田大学, 理工学研究科, 物理学及応用物理学専攻

大学院, 修了, 博士

このページの先頭へ▲

学位【表示／非表示】

博士（工学）, 早稲田大学, 課程, 2001年02月

このページの先頭へ▲

研究分野【表示／非表示】

ものづくり技術（機械・電気電子・化学工学） / 計測工学（Measurement Engineering）
情報通信 / データベース（メディア情報学・データベース）
情報通信 / 知覚情報処理（知覚情報処理・知能ロボティクス）
ライフサイエンス / 医用システム（Medical Systems）

このページの先頭へ▲

著書【表示／非表示】

画像センシングのしくみと開発がしっかりわかる教科書

青木義満，輿水大和他, 技術評論社, 2023年06月, ページ数： 239
顔の百科事典

丸善出版, 2015年09月

担当範囲: 7 章コンピュータと顔 ─顔の情報学─

　概要を見る

顔を見ない日はないというくらい、「顔」は私達にとってあたり前の存在ですが、私達は一体どれほど「顔」のことを知っているのでしょうか。そのような「顔」を総合的に研究するのが「顔学」です。顔学には、動物学や人類学をはじめ、解剖学、生理学、歯学、心理学、社会学の文化的な対象として扱われるだけでなく、演劇や美術などの芸術学、コンピュータの分野では、情報学、さらに、美容学、人相学など、実に多様な学問分野と関係しています。本書では、私達と切り離すことのできない「顔」の、歴史的・文化的・社会的・科学的側面を中項目の事典としてまとめられていることにより、多様な分野を横断する知識にも容易にアクセスが可能になっています。日本顔学会創立20周年記念出版として、「顔学」について体系化を行った、初めての百科事典です。
三次元画像センシングの新展開

青木義満, NTS, 2015年05月

担当範囲: 第5章1節　色情報とレンジデータのフュージョンによる高分解能三次元レンジセンサの開発
電気学会125年史

青木義満，秦　清治, 電気学会, 2013年05月
電気学会125年史

青木義満, 電気学会, 2013年05月

全件表示 >>

このページの先頭へ▲

論文【表示／非表示】

A Comprehensive Analysis of a Social Intelligence Dataset and Response Tendencies Between Large Language Models (LLMs) and Humans

Mori E., Qiu Y., Kataoka H., Aoki Y.

Sensors 25 （ 2 ） 2025年01月

　概要を見る

In recent years, advancements in the interaction and collaboration between humans and have garnered significant attention. Social intelligence plays a crucial role in facilitating natural interactions and seamless communication between humans and Artificial Intelligence (AI). To assess AI’s ability to understand human interactions and the components necessary for such comprehension, datasets like Social-IQ have been developed. However, these datasets often rely on a simplistic question-and-answer format and lack justifications for the provided answers. Furthermore, existing methods typically produce direct answers by selecting from predefined choices without generating intermediate outputs, which hampers interpretability and reliability. To address these limitations, we conducted a comprehensive evaluation of AI methods on a video-based Question Answering (QA) benchmark focused on human interactions, leveraging additional annotations related to human responses. Our analysis highlights significant differences between human and AI response patterns and underscores critical shortcomings in current benchmarks. We anticipate that these findings will guide the creation of more advanced datasets and represent an important step toward achieving natural communication between humans and AI.
- Access to Document (DOI)
DynamicVLN: Incorporating Dynamics into Vision-and-Language Navigation Scenarios

Sun Y., Qiu Y., Aoki Y.

Sensors 25 （ 2 ） 2025年01月

　概要を見る

Traditional Vision-and-Language Navigation (VLN) tasks require an agent to navigate static environments using natural language instructions. However, real-world road conditions such as vehicle movements, traffic signal fluctuations, pedestrian activity, and weather variations are dynamic and continually changing. These factors significantly impact an agent’s decision-making ability, underscoring the limitations of current VLN models, which do not accurately reflect the complexities of real-world navigation. To bridge this gap, we propose a novel task called Dynamic Vision-and-Language Navigation (DynamicVLN), incorporating various dynamic scenarios to enhance the agent’s decision-making abilities and adaptability. By redefining the VLN task, we emphasize that a robust and generalizable agent should not rely solely on predefined instructions but must also demonstrate reasoning skills and adaptability to unforeseen events. Specifically, we have designed ten scenarios that simulate the challenges of dynamic navigation and developed a dedicated dataset of 11,261 instances using the CARLA simulator (ver.0.9.13) and large language model to provide realistic training conditions. Additionally, we introduce a baseline model that integrates advanced perception and decision-making modules, enabling effective navigation and interpretation of the complexities of dynamic road conditions. This model showcases the ability to follow natural language instructions while dynamically adapting to environmental cues. Our approach establishes a benchmark for developing agents capable of functioning in real-world, dynamic environments and extending beyond the limitations of static VLN tasks to more practical and versatile applications.
- Access to Document (DOI)
Data Collection-Free Masked Video Modeling

Ishikawa Y., Kondo M., Aoki Y.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 15069 LNCS 37 - 56 2025年

ISSN 03029743

　概要を見る

Pre-training video transformers generally requires a large amount of data, presenting significant challenges in terms of data collection costs and concerns related to privacy, licensing, and inherent biases. Synthesizing data is one of the promising ways to solve these issues, yet pre-training solely on synthetic data has its own challenges. In this paper, we introduce an effective self-supervised learning framework for videos that leverages readily available and less costly static images. Specifically, we define the Pseudo Motion Generator (PMG) module that recursively applies image transformations to generate pseudo-motion videos from images. These pseudo-motion videos are then leveraged in masked video modeling. Our approach is applicable to synthetic images as well, thus entirely freeing video pre-training from data collection costs and other concerns in real data. Through experiments in action recognition tasks, we demonstrate that this framework allows effective learning of spatio-temporal features through pseudo-motion videos, significantly improving over existing methods which also use static images and partially outperforming those using both real and synthetic videos. These results uncover fragments of what video transformers learn through masked video modeling.
- Access to Document (DOI)
Unsupervised Metric Learning for Expressing Color and Shape Information to Uncover Abstract Connections within Image Datasets

Obikane S., Tagawa H., Aoki Y.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 15321 LNCS 15 - 30 2025年

ISSN 03029743

　概要を見る

In this research, we propose a novel approach using unsupervised metric learning tailored to datasets characterized by complex similarities and connections, such as those found in paintings and makeup, which are challenging to express linguistically. These datasets often present the difficulty of adequately analyzing data points due to the intricate interplay of defining elements, a limitation of traditional labeling methods. Additionally, the high degree of specialization required makes annotation significantly costly. Unsupervised metric learning emerges as a powerful method for extracting more cost-effective features and for the comprehensive analysis of these datasets. Expanding upon previous research that utilized style transfer models, our study further explores feature design, specifically focusing on extracting detailed information about critical aspects of similarity assessment, such as color and shape. Our model adeptly incorporates visual information, unveiling the hidden abstract connections within datasets. We validated our approach using a dataset of Ukiyo-e, a genre of Japanese painting, and achieved accuracy comparable to supervised learning models. This research opens up new possibilities for the analysis of complex image datasets with abstract relational depth, fostering a deeper understanding of the data.
- Access to Document (DOI)
Rethinking Image Super-Resolution from Training Data Perspectives

Ohtani G., Tadokoro R., Yamada R., Asano Y.M., Laina I., Rupprecht C., Inoue N., Yokota R., Kataoka H., Aoki Y.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 15075 LNCS 19 - 36 2025年

ISSN 03029743

　概要を見る

In this work, we investigate the understudied effect of the training data used for image super-resolution (SR). Most commonly, novel SR methods are developed and benchmarked on common training datasets such as DIV2K and DF2K. However, we investigate and rethink the training data from the perspectives of diversity and quality, thereby addressing the question of “How important is SR training for SR models?”. To this end, we propose an automated image evaluation pipeline. With this, we stratify existing high-resolution image datasets and larger-scale image datasets such as ImageNet and PASS to compare their performances. We find that datasets with (i) low compression artifacts, (ii) high within-image diversity as judged by the number of different objects, and (iii) a large number of images from ImageNet or PASS all positively affect SR performance. We hope that the proposed simple-yet-effective dataset curation pipeline will inform the construction of SR datasets in the future and yield overall better models. Code is available at: https://github.com/gohtanii/DiverSeg-dataset.
- Access to Document (DOI)

全件表示 >>

このページの先頭へ▲

KOARA（リポジトリ）収録論文等【表示／非表示】

機械学習による乱流ビッグデータの特徴抽出手法の構築

青木, 義満

科学研究費補助金研究成果報告書 2020年
目的指向運動における乳幼児の視線制御と微細運動 : 11ヶ月児と18ヶ月児の比較

青木, 義満

慶応義塾大学大学院社会学研究科紀要 : 社会学心理学教育学 : 人間と社会の探究（慶應義塾大学大学院社会学研究科）（ 82 ） 17 - 35 2016年

ISSN 0912456X
産学連携のカタチ

青木, 義満

新版窮理図解（慶應義塾大学理工学部）（ 15 ） 2014年01月
The way industry-and-academia collaboration is conducted

青木, 義満

New Kyurizukai （Faculty of Science and Technology, Keio University）（ 15 ） 2014年01月

このページの先頭へ▲

総説・解説等【表示／非表示】

密集領域での動作を理解するためのハイブリッド型映像解析

大内一成，小林大祐，中州俊信，青木義満

東芝レビュー（東芝） 72 （ 4 ） 30 - 34 2017年09月

機関テクニカルレポート，技術報告書，プレプリント等, 共著
画像センシング技術によるチームスポーツ映像からのプレー解析

林　昌希，青木　義満

映像情報メディア学会誌（映像情報メディア学会） 70 （ 5 ） 710 - 714 2016年09月

記事・総説・解説・論説等（学術雑誌）, 共著
人物行動認識・理解のための画像センシング技術と応用

青木義満

非破壊検査（日本非破壊検査協会） 65 （ 6 ） 254 - 260 2016年06月

記事・総説・解説・論説等（学術雑誌）, 単著
パターン計測技術の深化と広がる産業応用 -総論-

青木義満

計測と制御（計測自動制御学会） 53 （ 7 ） 555 - 556 2014年07月

記事・総説・解説・論説等（学術雑誌）, 単著

このページの先頭へ▲

研究発表【表示／非表示】

自由な表現と被写体の質感を維持するメイク生成モデルの開発

帯金駿, 田川晴菜, 中川雄介, 中村理恵, 青木義満

[国内会議] 第27回日本顔学会大会（フォーラム顔学2022）,

2022年09月
,
口頭発表（一般）
不確実性を考慮したセマンティックマップの生成

竹中悠，森巧磨，谷口恭弘，青木義満

[国内会議] 第27回知能メカトロニクスワークショップ,

2022年09月
,
口頭発表（一般）
重要パッチ選択に基づく効率的動画認識

鈴木智之, 青木義満

[国内会議] 第25回画像の認識・理解シンポジウム（MIRU2022）,

2022年07月
,
ポスター発表
音響信号を用いた人物の3次元姿勢推定

川島穣, 柴田優斗, 五十川麻理子, 入江豪, 木村昭悟, 青木義満

[国内会議] 第25回画像の認識・理解シンポジウム（MIRU2022）,

2022年07月
,
口頭発表（一般）
完全合成画像での学習による文書画像の影除去

松尾祐飛，青木義満

[国内会議] 第28回画像センシングシンポジウム（SSII2022）,

2022年06月
,
ポスター発表

全件表示 >>

このページの先頭へ▲

知的財産権等【表示／非表示】

画像処理装置，画像処理プログラムおよび画像処理方法

出願日： 2019-105297 2019年06月

共同
危険度推定装置，危険度推定方法及び危険度推定用コンピュータプログラム

出願日：特願2015-005241 2015年01月

発行日：特許第6418574号 2018年10月

特許権, 共同

このページの先頭へ▲

受賞【表示／非表示】

HCGシンポジウム2018 特集テーマセッション賞

秋月秀一(慶大)・大木美加・バティストブロー・鈴木健嗣(筑波大)・青木義満(慶大), 2018年12月, 電子情報通信学会ヒューマンコミュニケーショングループ, 床面プロジェクションに伴う動的な環境変化に対応する人物追跡技術

受賞区分：国内学会・会議・シンポジウム等の賞
HCGシンポジウム2018 優秀インタラクティブ発表賞

秋月秀一(慶大)・大木美加・バティストブロー・鈴木健嗣(筑波大)・青木義満(慶大), 2018年12月, 電子情報通信学会ヒューマンコミュニケーショングループ, 床面プロジェクションに伴う動的な環境変化に対応する人物追跡技術

受賞区分：国内学会・会議・シンポジウム等の賞
精密工学会沼田記念論文賞

加藤直樹，箱崎浩平，里雄二，古山純子，田靡雅基，青木ヨシミツ, 2018年03月, 精密工学会, 畳み込みニューラルネットワークによる距離学習を用いた動画像人物再同定

受賞区分：国内学会・会議・シンポジウム等の賞
IWAIT2018 Best Paper Award

Ryunosuke Kurose, Masaki Hayashi, Yoshimitsu Aoki, 2018年01月, IWAIT2018

受賞区分：国内外の国際的学術賞
IES-KCIC2017 Best Paper Award

Siti Nor Khuzaimah Amit, Yoshimitsu Aoki, 2017年09月, IEEE Indonesia Section, Disaster Detection from Aerial Imagery with Convolutional Neural Network

受賞区分：国内外の国際的学術賞

全件表示 >>

このページの先頭へ▲

担当授業科目【表示／非表示】

電気情報工学セミナーⅡ

2025年度
電気情報工学セミナーⅠ

2025年度
電気情報工学輪講

2025年度
電気情報工学実験第２

2025年度
総合デザイン工学課題研究

2025年度

全件表示 >>

このページの先頭へ▲

社会活動【表示／非表示】

画像情報教育振興協会

2013年07月

-

2015年03月
独立行政法人交通安全環境研究所

2009年12月

-

2012年03月

このページの先頭へ▲

所属学協会【表示／非表示】

International Symposium on Optomechatronic Technologies 2013,

2013年04月

-

2013年11月
International Workshop on Advanced Image Technology 2013(IWAIT2013),

2013年01月

-

2013年09月
11th International Conference on Quality Control by Artificial Vision(QCAV2013),

2012年12月

-

2013年05月
3rd International Conference on 3D Body Scanning Technologies,

2012年06月

-

2012年10月
計測自動制御学会パターン計測部会,

2012年04月

-

継続中

全件表示 >>

このページの先頭へ▲

委員歴【表示／非表示】

2017年04月

-

継続中

NEDO技術委員, NEDO
2016年07月

-

2016年11月

Optics & Photonics Japan 2016　推進委員, 日本光学会
2016年07月

-

2016年12月

Program committee member, International Workshop on Human Tracking and Behavior Analysis 2016
2015年09月

-

2016年08月

第22回画像センシングシンポジウム　実行委員長, 画像センシング技術研究会
2014年09月

-

2015年08月

第21回画像センシングシンポジウム　実行委員長, 画像センシング技術研究会

全件表示 >>

このページの先頭へ▲