Ep. 240 - Part 1 - June 6, 2024

https://is1-ssl.mzstatic.com/image/thumb/Podcasts126/v4/4a/9c/ef/4a9ceff8-5c1a-e15c-62d9-6360c52cd38a/mza_2283181023971434852.jpg/600x600bb.jpg

TechcraftingAI Computer Vision

Brad Edwards

315 episodes

8 hours ago

TechcraftingAI Computer Vision brings you summaries of the latest arXiv research daily. Research is read by your virtual host, Sage. The podcast is produced by Brad Edwards, an AI Engineer from Vancouver, BC, and a graduate student of computer science studying AI at the University of York. Thank you to arXiv for use of its open access interoperability.

Technology

RSS

All content for TechcraftingAI Computer Vision is the property of Brad Edwards and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Technology

https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/39305030/39305030-1703089970889-aab16cf4a6955.jpg

Ep. 240 - Part 1 - June 6, 2024

TechcraftingAI Computer Vision

49 minutes 14 seconds

1 year ago

Ep. 240 - Part 1 - June 6, 2024

ArXiv Computer Vision research for Thursday, June 06, 2024.

00:20: ReDistill: Residual Encoded Distillation for Peak Memory Reduction

01:58: Instance Segmentation and Teeth Classification in Panoramic X-rays

03:34: Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge

04:44: Amortized Equation Discovery in Hybrid Dynamical Systems

05:57: Monocular Localization with Semantics Map for Autonomous Vehicles

07:22: From operculum and body tail movements to different coupling of physical activity and respiratory frequency in farmed gilthead sea bream and European sea bass. Insights on aquaculture biosensing

09:36: Semantic Similarity Score for Measuring Visual Similarity at Semantic Level

11:32: LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model

13:12: Polyp and Surgical Instrument Segmentation with Double Encoder-Decoder Networks

13:52: C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction

15:19: Data-Centric Label Smoothing for Explainable Glaucoma Screening from Eye Fundus Images

16:39: Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following

18:03: Frequency-based Matcher for Long-tailed Semantic Segmentation

19:28: LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression

21:18: LNQ Challenge 2023: Learning Mediastinal Lymph Node Segmentation with a Probabilistic Lymph Node Atlas

22:45: 3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation

23:30: Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt

25:10: Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis

26:03: Shaping History: Advanced Machine Learning Techniques for the Analysis and Dating of Cuneiform Tablets over Three Millennia

28:01: Semmeldetector: Application of Machine Learning in Commercial Bakeries

29:08: Class-Aware Cartilage Segmentation for Autonomous US-CT Registration in Robotic Intercostal Ultrasound Imaging

30:45: How Far Can We Compress Instant-NGP-Based NeRF?

32:11: UrbanSARFloods: Sentinel-1 SLC-Based Benchmark Dataset for Urban and Open-Area Flood Mapping

34:01: Global Parameterization-based Texture Space Optimization

34:52: LenslessFace: An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification

36:22: The 3D-PC: a benchmark for visual perspective taking in humans and machines

38:29: Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization

40:08: Sparse Multi-baseline SAR Cross-modal 3D Reconstruction of Vehicle Targets

41:50: A Voxel-based Approach for Simulating Microbial Decomposition in Soil: Comparison with LBM and Improvement of Morphological Models

43:25: Encoding Semantic Priors into the Weights of Implicit Neural Representation

45:04: Diffusion-based image inpainting with internal learning

45:58: CDMamba: Remote Sensing Image Change Detection with Mamba

47:36: Matching Anything by Segmenting Anything