Spatiotemporal Deformable Part Models for Action Detection

Source

Evernote/Papers/Spatiotemporal Deformable Part Models for Action Detection.md

Summary

이 논문은 2D 이미지에서 성공적인 Deformable Part Models(DPM)을 3D 시공간 볼륨으로 확장하여 비디오 액션 감지에 적용한다. 각 액션을 시공간 패턴으로 취급하여 가장 판별력 있는 3D 서브볼륨을 부분(parts)으로 자동 선택하고, 이들 간의 시공간 관계를 학습한다. 이를 통해 클래스 내 변이에 적응하고 배경 잡음에 강인하며, 여러 비디오 데이터셋에서 액션 분류 및 위치 파악 성능을 입증했다.

Key Points

2D DPM을 3D 시공간 영역으로 일반화하여 비디오 액션 감지 적용
액션별 가장 특징적인 3D 서브볼륨을 부분으로 자동 선택 및 관계 학습
클래스 내 변이 적응 및 배경 잡음에 대한 강인성 확보
다양한 비디오 데이터셋에서 분류 및 위치 파악 성능 검증

Fast, Accurate Detection of 100,000 Object Classes on a Single Machine (Technical Supplement)
Understanding Indoor Scenes using 3D Geometric Phrases
Dynamic Time Warping for Music Conducting Gestures Evaluation
드론을 위한 엣지 기반 실시간 비디오 분석
Egocentric Field-of-View Localization Using First-Person Point-of-View Devices
3DNN: Viewpoint Invariant 3D Geometry Matching for Scene Understanding
다중 특징 분석 및 시맨틱 컨텍스트 학습을 통한 이미지 분류
적응형 온톨로지 규칙을 이용한 보상-처벌 기반 개념 탐지
Weakly Supervised Learning of Object Segmentations from Web-Scale Video
실내 장면의 의미적 및 기하학적 상호작용 학습을 위한 판별 모델
PRIME: 단입자 Cryo-EM을 위한 확률적 초기 3D 모델 생성
인간 동작 분석을 위한 특이값 분해(SVD) 기반 지식 획득 방법
Recursive Sparse Spatiotemporal Coding
다중 모드 특징 표현 및 시간 피라미드 매칭을 통한 콘텐츠 기반 복제 탐지
Content vs. Context: Video Landmark Retrieval
지오태그 이미지로부터 장면 위치 식별 (Identification of scene locations from geotagged images)
개인 사진 컬렉션에서의 공동 노이즈 레벨 추정
A Hamming Embedding Kernel with Informative Bag-of-Visual Words for Video Semantic Indexing
움직이며 소리를 내는 객체의 식별 및 분할을 위한 다중모달 분석
GPSView: 경치 중심의 운전 경로 계획 시스템
Efficient Closed-Form Solution to Generalized Boundary Detection
Coordinated Multi-Device Presentations: Ambient-Audio Identification
스테가노그래피를 활용한 카메라
3D 객체 검색을 위한 시맨틱 시그니처 학습
Robust and accurate mobile visual localization and its applications
Attribute-Augmented Semantic Hierarchy (A2SH) for CBIR
MS의 상황 인지형 자동 무음 모드 기술
A Top-Down Approach for Video Summarization
PD-Link: 카메라와 모바일 디바이스 연결 프레임워크
관성 항법 시스템(INavigation Systems) 소개
고양이 사진 메타데이터를 통한 위치 추적 프로젝트
Online Estimation of Evolving Human Visual Interest
Intel Research: Context Awareness - Social Proximity Detection
시간적 이미지 시퀀스를 위한 최적화된 만화 스토리텔링 시스템
Video Snippets
Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring
Sparse Hashing (SH) for Fast Multimedia Search
K-RBMs를 이용한 다중 비선형 부분공간 학습
Query-Adaptive Image Search With Hash Codes
화자 추적 기반 영상 자막 배치 (Speaker-Following Video Subtitles)
이마트 ‘세일 네비게이션’ (지능형 카트)
언어 독립적 시간 표현 판별적 파싱 (Language-Independent Discriminative Parsing of Temporal Expressions)
Supervised Robust Discrete Multimodal Hashing (SRDMH)
스마트폰 기반 실시간 TV 채널 인식 기술 (IRTR)
3D 센싱 기술, 모바일로 향하다
대규모 다중 라벨 전파를 위한 효율적인 희소 그래프 구성
Cross-Media Tag Transfer (CMTT): 이미지에서 비디오로 태그 지식 이전
Cross-Domain Feature Learning in Multimedia
Fast Near-Duplicate Image Detection Using Uniform Randomized Trees
정보 기하학을 통한 순수 고차 단어 연관성 마이닝
웹캠의 지리적 통합 및 보정 (Web-accessible geographic integration and calibration of webcams)
Discriminative Segment Annotation in Weakly Labeled Video
Transfer Joint Embedding for Cross-Domain NER
Structured Streaming Skeleton (SSS): 온라인 인간 제스처 인식용 새로운 특징 추출 방법
Interactive Image Tagging을 위한 인간 라벨링 최적화
실내 이동 객체를 위한 거리 기반 조인 (Distance-Aware Join for Indoor Moving Objects)
확산형 실내 광무선 통신 링크의 심볼간 간섭 완화를 위한 분류기 비교 연구
Script-to-Movie: 스크립트 기반 자동 영화 생성 프레임워크
Continuous Birdsong Recognition Using Gaussian Mixture Modeling of Image Shape Features
다중 Kinect 기반 실시간 3D 재구성
토론토대 연구진의 실시간 HDR 비디오 기술 (2013)
iVector-based Acoustic Data Selection
Point Representation for Local Optimization: Towards Multi-Dimensional Gray Codes
차등 데이터를 기반으로 렌더링 효율을 고려한 3D 트리의 압축
DOM 구조 지식 기반 모델을 이용한 반구조화 웹 레코드 강건한 탐지
Smartphones Based Crowdsourcing for Indoor Localization
MOWL: 웹 기반 멀티미디어 애플리케이션을 위한 온톨로지 표현 언어
TRECVID 기반 콘텐츠 기반 비디오 복사 탐지 벤치마킹
인간 행동 인식용 온톨로지 조사
Latent Mixture of Discriminative Experts (LMDE)
All Smiles: 얼굴 표정 분석을 통한 자동 사진 보정
A survey on ear biometrics
Robust Localization From Incomplete Local Information
이미지 주석 및 검색을 위한 Feature-Word-Topic 모델

AncomWiki

탐색기

Spatiotemporal Deformable Part Models for Action Detection

Spatiotemporal Deformable Part Models for Action Detection

Source

Summary

Key Points

그래프 뷰

목차

백링크

AncomWiki

탐색기

Spatiotemporal Deformable Part Models for Action Detection

Spatiotemporal Deformable Part Models for Action Detection

Source

Summary

Key Points

Related

그래프 뷰

목차

백링크