Loading...
Research Focus

Video Coding for Machines

We develops compression technologies optimized not for human perception, but for machine vision models (detection, segmentation, classification, etc.) — and for frameworks that serve both simultaneously.

FCM: Feature Coding for Machines (MPEG-FCM)

Rather than compressing pixels, FCM compresses intermediate feature maps extracted by neural networks. In a split inference pipeline, an edge device runs the front-end layers of a vision model, compresses the resulting features, and transmits them to a server that completes the task.

Our team has been a core contributor to the MPEG-FCM international standard (MPEG-AI) since its inception, attending every MPEG plenary and submitting proposals that have been adopted across ten successive Feature Coding Test Model (FCTM) versions.

Hybrid Vision Coding (Human & Machine)

A single bitstream that serves both human viewers and machine vision pipelines. Rather than compressing twice (simulcast), a unified codec delivers visual quality for human consumption while preserving task-critical information for AI models. We are coordinating a joint MPEG WG2 contribution with international partners to drive this toward a new international standard.

Video Coding for Machines
Research Timeline

Research Highlights

MPEG-FCM: Feature Coding for Machines — International Standard
2024–Present

We are core contributors to ISO/IEC 23088-2 (MPEG Feature Coding for Machines), the international standard for compressing neural network feature maps in split inference pipelines.

Since the standard’s inception, our proposals have been adopted across 10 FCTM versions — including L-MSFCv2, lightFCTM, PWD, and NN Inner Codec — advancing the test model from v1.0 to the current v9.0. We attend every MPEG plenary meeting, contributing technical proposals, cross-check evaluations, and editor-level contributions to the normative standard text.

Hybrid Vision Coding: A Unified Bitstream for Humans and Machines
2025–Present

We are pioneering hybrid vision coding — a compression framework that delivers both human-viewable reconstruction and machine task performance from a single bitstream, without the overhead of simulcast (compressing twice for two pipelines).

Two interface paradigms are under investigation:

Feature-Input (FCM extension): Machine vision uses decoded features directly; a lightweight human decoder reconstructs a viewable image from the same features.

Image-Input: Shared information is disentangled and recombined to serve both vision and reconstruction paths with high efficiency.

We are coordinating a joint contribution to MPEG WG2 (October 2026) with international partners (KHU, ZJU, SFU) to formally define use cases, requirements, and common test conditions — and to propose an official Exploration Experiment, the first formal step toward a new international standard for hybrid vision coding.

Publications

Related Papers & Contributions

TitleVenueYear
[Container] [FCM] Description on CE 3149th ISO/IEC JTC 1/SC 29 MPEG2025
[Container] [FCM] FCTM Algorithm Description149th ISO/IEC JTC 1/SC 29 MPEG2025
[Container] [FCM] FCTM Algorithm Description151th ISO/IEC JTC 1/SC 29 MPEG2025
[Container] CE4 description149th ISO/IEC JTC 1/SC 29 MPEG2025
[Container] FE1 description149th ISO/IEC JTC 1/SC 29 MPEG2025
[Container] FE2 description149th ISO/IEC JTC 1/SC 29 MPEG2025
[Container] Preliminary WD for FCM149th ISO/IEC JTC 1/SC 29 MPEG2025
[Container] Training Description149th ISO/IEC JTC 1/SC 29 MPEG2025
[Container][FCM] CE4 description150th ISO/IEC JTC 1/SC 29 MPEG2025
[Container][FCM] FE1 description150th ISO/IEC JTC 1/SC 29 MPEG2025
[Container][FCM] FE2 description150th ISO/IEC JTC 1/SC 29 MPEG2025
[Container][FCM] Training Description150th ISO/IEC JTC 1/SC 29 MPEG2025
[Container][FCM] Training Description151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] Applying update-flag (m73365) on CE4.3 TCFC (m73580)151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] CE 3.1: Anchor Generation150th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] CE 3.1: Anchor Generation151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] CE 3.1: Results on L-MSFC-v2 extension for inter coding149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] CE1.1.4: LightFCTM149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] CE3 summary report150th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] CE3 summary report151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] CE4 summary report149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] CE4 summary report151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] Clarifications on the feature unpacking process in both PWD and reference software149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] Clarifications regarding the current PWD149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] Crosscheck Result for CE 1.1.3149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] Crosscheck Result for m71200149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] Editorial comments on preliminary WD150th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] FCTM anchor candidate experiment149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] FE1 summary report149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] FE2 summary report149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] FE2 summary report151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM] Proposal on CTTC and Anchor for NN-based Coding Mode149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][CE4] Summary report150th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][CTTC/FE2] Removing Kimono and Cactus sequence as mandatory test conditions151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][FE1] Performance Evaluation of FCTMv5 Trained with the FE1.1 Training Code149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][FE1] Results for training reproducibility experiment from KHU151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][FE1] Summary report150th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][FE2] Crosscheck for FE2 Study3 m72212150th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][FE2] Crosscheck result for Study1 remote inference151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][FE2] Crosscheck result for Study2 remote and split inference151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][FE2] Partial results for Study1 split-inference anchor for YOLOX-DarkNet53 with L13 and L37 split point151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][FE2] Preliminary Anchor Results for FE2 Study 1 and 2149th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][FE2] Summary report150th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][PWD] Issues and solutions for signalling of refinement and channel removal parameters150th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][PWD] Removing restoration bypass flag and improving description for default topology150th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][PWD] Software implementation and BD-rate results for m72608 (Signalling improvement and bug fix for reduced feature inverse normalization)151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][PWD] Software implementation for automatic composition for default topology151th ISO/IEC JTC 1/SC 29 MPEG2025
[FCM][PWD]Signalling improvement and bug fix for reduced feature inverse normalization150th ISO/IEC JTC 1/SC 29 MPEG2025
FCTM 6.0의 신경망 기반 특징맵 변환 기술 분석방송공학회논문지2025DOI
[Container] [FCM] Description on CE 3148th ISO/IEC JTC 1/SC 29 MPEG2024
[Container] [FCM] FCTM Algorithm Description148th ISO/IEC JTC 1/SC 29 MPEG2024
[Container][FCM] CE4 description148th ISO/IEC JTC 1/SC 29 MPEG2024
[Container][FCM] Description on FE3148th ISO/IEC JTC 1/SC 29 MPEG2024
[Container][FCM] FE1 description148th ISO/IEC JTC 1/SC 29 MPEG2024
[Container][FCM] FE2 description148th ISO/IEC JTC 1/SC 29 MPEG2024
[Container][FCM] PWD for FCM148th ISO/IEC JTC 1/SC 29 MPEG2024
[Container][FCM] Training Description148th ISO/IEC JTC 1/SC 29 MPEG2024
[FCM] CE 1.1.8. L-MSFC-v2 with fine-tuning145th ISO/IEC JTC 1/SC 29 MPEG2024
[FCM] CE 3.1: Results on L-MSFC-v2 extension for inter coding148th ISO/IEC JTC 1/SC 29 MPEG2024
[FCM] CE 3.2.2 L-MSFC-v2145th ISO/IEC JTC 1/SC 29 MPEG2024
[FCM] CE3 summary report148th ISO/IEC JTC 1/SC 29 MPEG2024
[FCM] Crosscheck of m69985 (CE 3.2: Machine Saliency Compression Based on Temporal Single Input for Multiple Output Architecture)148th ISO/IEC JTC 1/SC 29 MPEG2024
[FCM] Crosscheck of m70057 (CE4 related: Non-linear feature transform)148th ISO/IEC JTC 1/SC 29 MPEG2024
[FCM] FE3: Summary report148th ISO/IEC JTC 1/SC 29 MPEG2024
[FCM] KMAC/pixel calculation for FCTM148th ISO/IEC JTC 1/SC 29 MPEG2024
FCM 을 위한 정규화된 융합 특징맵 부호화2024년 한국방송·미디어공학회 하계학술대회2024
MPEG FCM 테스트 모델에 대한 시간적 재 표본화 적용 및 성능 분석방송공학회논문지2024DOI
기계비전을 위한 특징맵 압축 표준화 요구사항 및 실험조건2024년 한국방송·미디어공학회 하계학술대회2024
다중 해상도 특징맵의 효율적 부호화 방안2024년 한국방송·미디어공학회 하계학술대회2024
단일 구조로 다중 비전 모델을 지원하기 위한 FCM 인터페이스 단일화2024년 한국방송·미디어공학회 하계학술대회2024
[FCVCM] E2E approach : Extension of L-MSFC-v2 Intra (m65200) for inter frame coding144th ISO/IEC JTC 1/SC 29 MPEG2023
[FCVCM] Hybrid codec approach : Combination of L-MSFC-v2 Intra (m65200) with VVC144th ISO/IEC JTC 1/SC 29 MPEG2023
[FCVCM] Inter-Layer Feature Resizing for FCVCM143th ISO/IEC JTC 1/SC 29 MPEG2023
[FCVCM] L-MSFC: End-to-End Learnable Multi-Scale Feature Compression143th ISO/IEC JTC 1/SC 29 MPEG2023
[FCVCM] Pareto-fronting with Multiple Inter-Layer Feature Resizing Modes143th ISO/IEC JTC 1/SC 29 MPEG2023
[FCVCM] Response to FCVCM Call for Proposal from Kyung Hee University and ETRI144th ISO/IEC JTC 1/SC 29 MPEG2023
[VCM track 1] [Crosscheck] Crosscheck report on m59576140th ISO/IEC JTC 1/SC 29 MPEG2023
[VCM Track 1] Crosscheck report on m60799138th ISO/IEC JTC 1/SC 29 MPEG2023
A Super-Resolution-Based Feature Map Compression for Machine-Oriented Video CodingAccess2023DOI
End-to-End Learnable Multi-Scale Feature Copression for VCMTransactions on Circuits and Systems for Video Technology2023DOI
MEDO: Minimizing Effective Distortions Only for Machine-Oriented Visual Feature CompressionIEEE International Conference on Visual Communications and Image Processing2023
VVC와 특징맵 융합/재구성 신경망을 이용한 다중 스케일 특징맵 압축 기법2023년 한국방송·미디어공학회 하계학술대회2023
다중 작업 지원을 위한 배치 병합 학습 기반의 특징맵 압축 방법2023년 대한전자공학회 하계학술대회 논문집2023
블록 기반 특징맵 크기 조정을 이용한 DNN 특징맵 압축한국방송·미디어공학회 2022 하계학술대회2022
An Analysis on the Properties of Features against Various Distortions in Deep Neural Networks방송공학회논문지2021DOI
VCM Anchor 성능 평가 및 분석제30회 신호처리 합동학술대회2020
VCM 구조 분석 및 VVC 기반 Feature 부호화 성능 분석제30회 신호처리 합동학술대회2020
Back to About