Media Summary: In this paper, we study the problem of temporal video grounding (TVG), which aims to predict the starting/ending time points of ... Video of our paper titled: "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" Project page ... [CVPR-2023] Advancing Visual Grounding with Scene Knowledge: Benchmark and Method

Timebalance Cvpr 2023 - Detailed Analysis & Overview

In this paper, we study the problem of temporal video grounding (TVG), which aims to predict the starting/ending time points of ... Video of our paper titled: "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" Project page ... [CVPR-2023] Advancing Visual Grounding with Scene Knowledge: Benchmark and Method QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation. Project page: Code/models/benchmarks: Paper: ... A presentation of our paper "Robust Test-time Adaptation in Dynamic Scenarios", which has been accepted by

TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous ... ProjectPage: Arxiv: HomePage Abstract: ... Tl;dr: We propose a new approach to video-language representation learning by leveraging pre-trained large language models ...

Photo Gallery

TimeBalance [CVPR 2023]
[CVPR 2023] Text-Visual Prompting for Efficient 2D Temporal Video Grounding
[CVPR 2023] How Can Objects Help Action Recognition?
CVPR 2023 - TempSAL - Uncovering Temporal Information for Deep Saliency Prediction
[CVPR-2023] Advancing Visual Grounding with Scene Knowledge: Benchmark and Method
[CVPR 2023 Highlight] QPGesture Presentation Video
CVPR 2023 - Use Your Head: Improving Long-Tail Video Recognition
[CVPR 2023] Run, Don’t Walk:  Chasing Higher FLOPS for Faster Neural Networks
[CVPR 2023] Robust Test-time Adaptation in Dynamic Scenarios
[CVPR 2023] TBP-Former Presentation Video
[CVPR 2023] Glocal Energy-based Learning for Few-Shot Open-Set Recognition
CVPR 2023 - Video Test-Time Adaptation for Action Recognition
View Detailed Profile
TimeBalance [CVPR 2023]

TimeBalance [CVPR 2023]

TimeBalance

[CVPR 2023] Text-Visual Prompting for Efficient 2D Temporal Video Grounding

[CVPR 2023] Text-Visual Prompting for Efficient 2D Temporal Video Grounding

In this paper, we study the problem of temporal video grounding (TVG), which aims to predict the starting/ending time points of ...

[CVPR 2023] How Can Objects Help Action Recognition?

[CVPR 2023] How Can Objects Help Action Recognition?

Presentation for

CVPR 2023 - TempSAL - Uncovering Temporal Information for Deep Saliency Prediction

CVPR 2023 - TempSAL - Uncovering Temporal Information for Deep Saliency Prediction

Video of our paper titled: "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" Project page ...

[CVPR-2023] Advancing Visual Grounding with Scene Knowledge: Benchmark and Method

[CVPR-2023] Advancing Visual Grounding with Scene Knowledge: Benchmark and Method

[CVPR-2023] Advancing Visual Grounding with Scene Knowledge: Benchmark and Method

[CVPR 2023 Highlight] QPGesture Presentation Video

[CVPR 2023 Highlight] QPGesture Presentation Video

QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation.

CVPR 2023 - Use Your Head: Improving Long-Tail Video Recognition

CVPR 2023 - Use Your Head: Improving Long-Tail Video Recognition

Project page: https://tobyperrett.github.io/lmr/ Code/models/benchmarks: https://github.com/tobyperrett/lmr-release Paper: ...

[CVPR 2023] Run, Don’t Walk:  Chasing Higher FLOPS for Faster Neural Networks

[CVPR 2023] Run, Don’t Walk: Chasing Higher FLOPS for Faster Neural Networks

Official video for the

[CVPR 2023] Robust Test-time Adaptation in Dynamic Scenarios

[CVPR 2023] Robust Test-time Adaptation in Dynamic Scenarios

A presentation of our paper "Robust Test-time Adaptation in Dynamic Scenarios", which has been accepted by

[CVPR 2023] TBP-Former Presentation Video

[CVPR 2023] TBP-Former Presentation Video

TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous ...

[CVPR 2023] Glocal Energy-based Learning for Few-Shot Open-Set Recognition

[CVPR 2023] Glocal Energy-based Learning for Few-Shot Open-Set Recognition

Supplementary Video for

CVPR 2023 - Video Test-Time Adaptation for Action Recognition

CVPR 2023 - Video Test-Time Adaptation for Action Recognition

ProjectPage: https://wlin-at.github.io/vitta Arxiv: https://arxiv.org/abs/2211.15393 HomePage https://wlin-at.github.io/ Abstract: ...

(CVPR 2023 Highlight) Learning Video Representations from Large Language Models

(CVPR 2023 Highlight) Learning Video Representations from Large Language Models

Tl;dr: We propose a new approach to video-language representation learning by leveraging pre-trained large language models ...