Image & Video Understanding

We research and innovate in image and video understanding, including object recognition, augmented-reality, visual aesthetics, and creativity. Our work directly impacts many of our flagship products and platforms by powering core technologies such as video highlights, labeling and summarization, and search by visual content.

Publications

Paper
|
April 8, 2024
CVPR 2024 - Generative Models for Computer Vision

Salient Object-Aware Background Generation Using Text-Guided Diffusion Models

Paper
|
December 22, 2023
WACV 2023

Unifying Margin-Based Softmax Losses in Face Recognition

Paper
|
July 5, 2023
Ad KDD 2023

Staging E-Commerce Products for Online Advertising Using Retrieval Assisted Image Generation

Paper
|
July 22, 2022
IEEE International Conference On Image Processing (ICIP) 2022

Temporally Precise Action Spotting in Soccer Videos Using Dense Detection Anchors

Dataset
|
December 10, 2021

Challenging Fashion Queries

Paper
|
August 11, 2021
ACM Multimedia 2021 (Industrial Track)

Distantly Supervised Semantic Text Detection and Recognition for Broadcast Sports Videos Understanding

Paper
|
July 22, 2021
ICCV 2021

UnitedFace: A Unified Perspective on Margin Softmax Losses for Face Recognition

Paper
|
May 17, 2021
KDD 2021

VisualTextRank: Unsupervised Graph-Based Content Extraction for Automating Ad Text to Image Search

Paper
|
September 4, 2019
NeurIPS

Image Captioning: Transforming Objects into Words

Paper
|
January 1, 2019
CVPR

Toward Realistic Image Compositing with Adversarial Learning

Paper
|
January 1, 2019
ICNC 2019

Domain-Specific Image Classification Using Ensemble Learning Utilizing Open-Domain Knowledge