Home
Categories
EXPLORE
True Crime
Comedy
Business
Society & Culture
Health & Fitness
Sports
Technology
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Podjoint Logo
US
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts126/v4/4a/9c/ef/4a9ceff8-5c1a-e15c-62d9-6360c52cd38a/mza_2283181023971434852.jpg/600x600bb.jpg
TechcraftingAI Computer Vision
Brad Edwards
315 episodes
5 days ago
TechcraftingAI Computer Vision brings you summaries of the latest arXiv research daily. Research is read by your virtual host, Sage. The podcast is produced by Brad Edwards, an AI Engineer from Vancouver, BC, and a graduate student of computer science studying AI at the University of York. Thank you to arXiv for use of its open access interoperability.
Show more...
Technology
RSS
All content for TechcraftingAI Computer Vision is the property of Brad Edwards and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
TechcraftingAI Computer Vision brings you summaries of the latest arXiv research daily. Research is read by your virtual host, Sage. The podcast is produced by Brad Edwards, an AI Engineer from Vancouver, BC, and a graduate student of computer science studying AI at the University of York. Thank you to arXiv for use of its open access interoperability.
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/39305030/39305030-1703089970889-aab16cf4a6955.jpg
Ep. 245 - Part 2 - June 11, 2024
TechcraftingAI Computer Vision
36 minutes 57 seconds
1 year ago
Ep. 245 - Part 2 - June 11, 2024

ArXiv Computer Vision research for Tuesday, June 11, 2024.


00:21: NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images

01:27: Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph

03:14: T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text

04:45: Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images

06:23: FaceGPT: Self-supervised Learning to Chat about 3D Human Faces

07:52: RecMoDiffuse: Recurrent Flow Diffusion for Human Motion Generation

09:15: VoxNeuS: Enhancing Voxel-Based Neural Surface Reconstruction via Gradient Interpolation

10:51: RAD: A Comprehensive Dataset for Benchmarking the Robustness of Image Anomaly Detection

12:05: RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker

13:52: MeMSVD: Long-Range Temporal Structure Capturing Using Incremental SVD

15:15: Can Foundation Models Reliably Identify Spatial Hazards? A Case Study on Curb Segmentation

16:56: MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

18:20: Open-World Human-Object Interaction Detection via Multi-modal Prompts

20:03: Which Country Is This? Automatic Country Ranking of Street View Photos

20:44: Needle In A Multimodal Haystack

22:10: Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models

23:24: Towards Realistic Data Generation for Real-World Super-Resolution

24:37: Unsupervised Object Detection with Theoretical Guarantees

25:43: Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs

27:45: A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation

29:01: Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field

30:24: Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach

32:09: Global-Regularized Neighborhood Regression for Efficient Zero-Shot Texture Anomaly Detection

33:52: Deep Implicit Optimization for Robust and Flexible Image Registration

35:28: Visual Representation Learning with Stochastic Frame Prediction

TechcraftingAI Computer Vision
TechcraftingAI Computer Vision brings you summaries of the latest arXiv research daily. Research is read by your virtual host, Sage. The podcast is produced by Brad Edwards, an AI Engineer from Vancouver, BC, and a graduate student of computer science studying AI at the University of York. Thank you to arXiv for use of its open access interoperability.