• Search Research Projects
  • Search Researchers
  • How to Use
  1. Back to previous page

VO MinhDuc  ヴォ ミンデュク

ORCIDConnect your ORCID iD *help
Researcher Number 40939906
Other IDs
Affiliation (based on the past Project Information) *help 2022 – 2024: 東京大学, 大学院情報理工学系研究科, 特任助教
Review Section/Research Field
Principal Investigator
Basic Section 61030:Intelligent informatics-related
Keywords
Principal Investigator
image captioning / vision - language / object recognition / dataset / multimodal LLMs / Long-tail data / Conditional GANs / Dataset / Story evaluation / Bias mitigation … More / External knowledge / GANs / Novel object captioning / Vision and language Less
  • Research Projects

    (2 results)
  • Research Products

    (25 results)
  • Co-Researchers

    (1 People)
  •  Unifying Object Detection and Image Captioning using Vision-Language Knowledge Base for Open-World ComprehensionPrincipal Investigator

    • Principal Investigator
      ヴォ ミンデュク
    • Project Period (FY)
      2024
    • Research Category
      Grant-in-Aid for Early-Career Scientists
    • Review Section
      Basic Section 61030:Intelligent informatics-related
    • Research Institution
      The University of Tokyo
  •  Vision and language cross-modal for training conditional GANs with long-tail data.Principal Investigator

    • Principal Investigator
      VO MinhDuc
    • Project Period (FY)
      2022 – 2023
    • Research Category
      Grant-in-Aid for Early-Career Scientists
    • Review Section
      Basic Section 61030:Intelligent informatics-related
    • Research Institution
      The University of Tokyo

All 2024 2023 2022

All Journal Article Presentation

  • [Journal Article] Soft Curriculum for Learning Conditional GANs with Noisy-Labeled and Uncurated Unlabeled Data2024

    • Author(s)
      Katsumata Kai、Vo Duc Minh、Harada Tatsuya、Nakayama Hideki
    • Journal Title

      2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

      Volume: - Pages: 5311-5320

    • DOI

      10.1109/wacv57701.2024.00524

    • Peer Reviewed / Open Access / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-23KJ0381, KAKENHI-PROJECT-22K17947, KAKENHI-PLANNED-22H05015, KAKENHI-PROJECT-23K28139
  • [Journal Article] NEMO: Can Multimodal LLMs Identify Attribute-Modified Objects?2024

    • Author(s)
      JiaXuan Li, Junwen Mo, Duc Minh Vo, Akihiro Sugimoto, Hideki Nakayama
    • Journal Title

      Arxiv

      Volume: arXiv:2411.17794 Pages: 1-8

    • Data Source
      KAKENHI-PROJECT-24K20830
  • [Journal Article] Persistent Test-time Adaptation in Recurring Testing Scenarios2024

    • Author(s)
      Trung-Hieu Hoang, Duc Minh Vo, Minh N.Do
    • Journal Title

      The Thirty-eighth Annual Conference on Neural Information Processing Systems, NeurIPS 2024

      Volume: 37 Pages: 123402-123442

    • Peer Reviewed / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-24K20830
  • [Journal Article] EVCap: Retrieval-Augmented Image Captioning with External Visual--Name Memory for Open-World Comprehension2024

    • Author(s)
      Li Jiaxuan、Vo Duc Minh、Sugimoto Akihiro, Nakayama Hideki
    • Journal Title

      2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

      Volume: 1

    • Peer Reviewed / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947
  • [Journal Article] Label Augmentation as Inter-class Data Augmentation for Conditional Image Synthesis with Imbalanced Data2024

    • Author(s)
      Katsumata Kai、Vo Duc Minh、Nakayama Hideki
    • Journal Title

      2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

      Volume: - Pages: 4932-4941

    • DOI

      10.1109/wacv57701.2024.00487

    • Peer Reviewed / Open Access / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-23KJ0381, KAKENHI-PROJECT-22K17947, KAKENHI-PLANNED-22H05015, KAKENHI-PROJECT-23K28139
  • [Journal Article] Revisiting Latent Space of GAN Inversion for Robust Real Image Editing2024

    • Author(s)
      Katsumata Kai、Vo Duc Minh、Liu Bei、Nakayama Hideki
    • Journal Title

      2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

      Volume: - Pages: 5301-5310

    • DOI

      10.1109/wacv57701.2024.00523

    • Peer Reviewed / Open Access / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-23KJ0381, KAKENHI-PROJECT-22K17947, KAKENHI-PROJECT-23K28139, KAKENHI-PROJECT-22H00540
  • [Journal Article] A Compact Dynamic 3D Gaussian Representation for Real-Time Dynamic View Synthesis2024

    • Author(s)
      Kai Katsumata, Duc Minh Vo, Hideki Nakayama 15144
    • Journal Title

      Computer Vision - ECCV 2024, The 18th European Conference on Computer Vision. Lecture Notes in Computer Science, Springer

      Volume: 15144 Pages: 394-412

    • DOI

      10.1007/978-3-031-73016-0_23

    • ISBN
      9783031730153, 9783031730160
    • Peer Reviewed / Open Access / Int'l Joint Research
    • Data Source
      KAKENHI-ORGANIZER-22H05012, KAKENHI-PLANNED-22H05015, KAKENHI-PROJECT-24K20830, KAKENHI-PROJECT-23KJ0381, KAKENHI-PROJECT-23K28139
  • [Journal Article] Questioning, Answering, and Captioning for Zero-Shot Detailed Image Caption2024

    • Author(s)
      Duc-Tuan Luu, Duc Minh Vo, Viet-Tuan Le
    • Journal Title

      Workshop on Large Vision - Language Model Learning and Applications, ACCV 2024

      Volume: 1 Pages: 242-259

    • Peer Reviewed / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-24K20830
  • [Journal Article] Indirect Adversarial Losses via an Intermediate Distribution for Training GANs2023

    • Author(s)
      Rui Yang, Duc Minh Vo, Hideki Nakayama
    • Journal Title

      Proceedings of the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

      Volume: - Pages: 4641-4650

    • DOI

      10.1109/wacv56688.2023.00463

    • Peer Reviewed / Open Access / Int'l Joint Research
    • Data Source
      KAKENHI-PLANNED-22H05015, KAKENHI-PROJECT-22H00540, KAKENHI-PROJECT-19H04166, KAKENHI-PROJECT-22K17947
  • [Journal Article] Partition-and-Debias: Agnostic Biases Mitigation via A Mixture of Biases-Specific Experts2023

    • Author(s)
      Li Jiaxuan、Vo Duc Minh、Nakayama Hideki
    • Journal Title

      2023 IEEE/CVF International Conference on Computer Vision (ICCV)

      Volume: 1 Pages: 4901-4911

    • DOI

      10.1109/iccv51070.2023.00454

    • Peer Reviewed / Open Access / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947, KAKENHI-PROJECT-22H00540, KAKENHI-PROJECT-23K28139
  • [Journal Article] A-CAP: Anticipation Captioning with Commonsense Knowledge2023

    • Author(s)
      Vo Duc Minh、Luong Quoc-An、Sugimoto Akihiro、Nakayama Hideki
    • Journal Title

      2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

      Volume: 1 Pages: 10824-10833

    • DOI

      10.1109/cvpr52729.2023.01042

    • Peer Reviewed / Open Access / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947, KAKENHI-ORGANIZER-22H05012, KAKENHI-PLANNED-22H05015
  • [Journal Article] NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External Knowledge2022

    • Author(s)
      Duc Minh Vo, Hong Chen, Akihiro Sugimoto, Hideki Nakayama
    • Journal Title

      2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

      Volume: - Pages: 17979-17987

    • DOI

      10.1109/cvpr52688.2022.01747

    • Peer Reviewed / Open Access / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947, KAKENHI-PROJECT-19H04166
  • [Journal Article] Stochastically Flipping Labels of Discriminator’s Outputs for Training Generative Adversarial Networks2022

    • Author(s)
      Rui Yang, Duc Minh Vo, Hideki Nakayama
    • Journal Title

      IEEE Access

      Volume: 10 Pages: 103644-103654

    • DOI

      10.1109/access.2022.3210130

    • Peer Reviewed / Open Access / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947, KAKENHI-PROJECT-22H00540, KAKENHI-ORGANIZER-22H05012
  • [Journal Article] StoryER: Automatic Story Evaluation via Ranking, Rating and Reasoning2022

    • Author(s)
      Hong Chen, Duc Vo, Hiroya Takamura, Yusuke Miyao, Hideki Nakayama
    • Journal Title

      Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing

      Volume: - Pages: 1739-1753

    • Peer Reviewed / Open Access / Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947
  • [Presentation] Soft Curriculum for Learning Conditional GANs with Noisy-Labeled and Uncurated Unlabeled Data2024

    • Author(s)
      Katsumata Kai、Vo Duc Minh、Harada Tatsuya、Nakayama Hideki
    • Organizer
      2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947
  • [Presentation] Questioning, Answering, and Captioning for Zero-Shot Detailed Image Caption2024

    • Author(s)
      Duc-Tuan Luu, Duc Minh Vo, Viet-Tuan Le
    • Organizer
      Workshop on Large Vision - Language Model Learning and Applications, ACCV 2024
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-24K20830
  • [Presentation] A Compact Dynamic 3D Gaussian Representation for Real-Time Dynamic View Synthesis2024

    • Author(s)
      Katsumata Kai、Vo Duc Minh、Nakayama Hideki
    • Organizer
      2024 European Conference on Computer Vision
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-24K20830
  • [Presentation] Label Augmentation as Inter-class Data Augmentation for Conditional Image Synthesis with Imbalanced Data2024

    • Author(s)
      Katsumata Kai、Vo Duc Minh、Nakayama Hideki
    • Organizer
      2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947
  • [Presentation] Persistent Test-time Adaptation in Recurring Testing Scenarios2024

    • Author(s)
      Trung-Hieu Hoang, Duc Minh Vo, Minh N.Do
    • Organizer
      The Thirty-eighth Annual Conference on Neural Information Processing Systems, NeurIPS 2024
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-24K20830
  • [Presentation] Revisiting Latent Space of GAN Inversion for Robust Real Image Editing2024

    • Author(s)
      Katsumata Kai、Vo Duc Minh、Liu Bei、Nakayama Hideki
    • Organizer
      2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947
  • [Presentation] A-CAP: Anticipation Captioning with Commonsense Knowledge2023

    • Author(s)
      Vo Duc Minh、Luong Quoc-An、Sugimoto Akihiro、Nakayama Hideki
    • Organizer
      2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947
  • [Presentation] Indirect Adversarial Losses via an Intermediate Distribution for Training GANs2023

    • Author(s)
      Yang Rui、Vo Duc Minh、Nakayama Hideki
    • Organizer
      2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947
  • [Presentation] Partition-and-Debias: Agnostic Biases Mitigation via A Mixture of Biases-Specific Experts2023

    • Author(s)
      Li Jiaxuan、Vo Duc Minh、Nakayama Hideki
    • Organizer
      2023 IEEE/CVF International Conference on Computer Vision (ICCV)
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947
  • [Presentation] NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External Knowledge2022

    • Author(s)
      Duc Minh Vo, Hong Chen, Akihiro Sugimoto, Hideki Nakayama
    • Organizer
      2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947
  • [Presentation] StoryER: Automatic Story Evaluation via Ranking, Rating and Reasoning2022

    • Author(s)
      Hong Chen, Duc Vo, Hiroya Takamura, Yusuke Miyao, Hideki Nakayama
    • Organizer
      2022 Conference on Empirical Methods in Natural Language Processing
    • Int'l Joint Research
    • Data Source
      KAKENHI-PROJECT-22K17947
  • 1.  中山 英樹
    # of Collaborated Projects: 0 results
    # of Collaborated Products: 2 results

URL: 

Are you sure that you want to link your ORCID iD to your KAKEN Researcher profile?
* This action can be performed only by the researcher himself/herself who is listed on the KAKEN Researcher’s page. Are you sure that this KAKEN Researcher’s page is your page?

この研究者とORCID iDの連携を行いますか?
※ この処理は、研究者本人だけが実行できます。

Information User Guide FAQ News Terms of Use Attribution of KAKENHI

Powered by NII kakenhi