Toru Tamaki, profile picture

Toru Tamaki

Sort by
論文紹介:MotionMatcher: Cinematic Motion Customizationof Text-to-Video Diffusion Models via Motion Feature Matching
論文紹介:DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
論文紹介:HiLoRA: Adaptive Hierarchical LoRA Routing for Training-Free Domain Generalization
論文紹介:InternVideo2: Scaling Foundation Models for Multimodal Video Understanding
論文紹介: "Locality-Aware Zero-Shot Human-Object Interaction Detection" "Disentangled Pre-training for Human-Object Interaction Detection" "Discovering Syntactic Interaction Clues for Human-Object Interaction Detection"
論文紹介:Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition
論文紹介:"Reflexion: language agents with verbal reinforcement learning", "MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding"
論文紹介:"MM-Tracker: Motion Mamba for UAV-platform Multiple Object Tracking", "MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model"
論文紹介:"Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing", KVTP, METEOR, STTM
論文紹介:"RAt: Injecting Implicit Bias for Text-To-Image Prompt Refinement Models", "Measuring What Matters: Evaluating Ensemble LLMs with Label Refinement in Inductive Coding", "Dynamic Label Name Refinement for Few-Shot Dialogue Intent Classification"
論文紹介:SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos
論文紹介:LoRA in LoRA: Towards Parameter-Efficient Architecture Expansion for Continual Visual Instruction Tuning
論文紹介:Unboxed: Geometrically and Temporally Consistent Video Outpainting
論文紹介:OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video​ Understanding?
論文紹介:HOTR: End-to-End Human-Object Interaction Detection​ With Transformers, Human-Object Interaction Detection​ via Disentangled Transformer, QPIC