1 Star 0 Fork 11

TYLove516 / PaddleVideo

forked from PaddlePaddle / PaddleVideo 
加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
克隆/下载
README_en.md 10.32 KB
一键复制 编辑 原始数据 按行查看 历史
huangjun12 提交于 2022-05-25 07:18 . init commit

简体中文 | English

PaddleVideo

Update:

  • add skeleton-base action recognition model CTR-GCN.
  • add lite action recognition model MoViNet.
  • add temporal segment model MS-TCN, ASRF.

​ 💖 Welcome to scan the code and join the group discussion 💖

  • Scan the QR code below with your Wechat and reply "video", you can access to official technical exchange group. Look forward to your participation.

Introduction

python version paddle version

PaddleVideo is a toolset for video tasks prepared for the industry and academia. This repository provides examples and best practice guildelines for exploring deep learning algorithm in the scene of video area.


Model and Applications

Model zoo

Action recognition method
PP-TSM (PP series) PP-TSN (PP series) PP-TimeSformer (PP series) TSN (2D’) TSM (2D')
SlowFast (3D’) TimeSformer (Transformer') VideoSwin (Transformer’) AttentionLSTM (RNN') MoViNet (Lite‘)
Skeleton based action recognition
ST-GCN (Custom’) AGCN (Adaptive') CTR-GCN (GCN‘)
Sequence action detection method
BMN (One-stage')
temporal segment
MS-TCN ASRF
Spatio-temporal motion detection method
SlowFast+Fast R-CNN
Multimodal
ActBERT (Learning') T2VLAD (Retrieval')
Video target segmentation
CFBI (Semi') MA-Net (Supervised')
Monocular depth estimation
ADDS (Unsupervised‘)

Dataset

Action Recognition
Kinetics-400 (Homepage) (CVPR'2017) UCF101 (Homepage) (CRCV-IR-12-01) ActivityNet (Homepage) (CVPR'2015) YouTube-8M (Homepage) (CVPR'2017)
Action Localization
ActivityNet (Homepage) (CVPR'2015)
Spatio-Temporal Action Detection
AVA (Homepage) (CVPR'2018)
Skeleton-based Action Recognition
NTURGB+D (Homepage) (IEEE CS'2016) FSD (Homepage)
Depth Estimation
Oxford-RobotCar (Homepage) (IJRR'2017)
Text-Video Retrieval
MSR-VTT (Homepage) (CVPR'2016)
Text-Video Pretrained Model
HowTo100M (Homepage) (ICCV'2019)

Applications

Applications Descriptions
FootballAction Football action detection solution
BasketballAction Basketball action detection solution
TableTennis Table tennis action recognition solution
FigureSkating Figure skating action recognition solution
VideoTag 3000-category large-scale video classification solution
MultimodalVideoTag Multimodal video classification solution
VideoQualityAssessment Video quality assessment solution
PP-Care 3DMRI medical image recognition solution
EIVideo Interactive video segmentation tool
Anti-UAV UAV detection solution
AbnormalActionDetection Abnormal action detection solution
PP-Human Action recognition solution for pedestrian analysis scene

Documentation tutorial

Competition

License

PaddleVideo is released under the Apache 2.0 license.

Thanks

Python
1
https://gitee.com/xhd0115/PaddleVideo.git
git@gitee.com:xhd0115/PaddleVideo.git
xhd0115
PaddleVideo
PaddleVideo
master

搜索帮助