M.S. in Artificial Intelligence ('26) · University of Michigan–Dearborn
MS AI student at the University of Michigan–Dearborn (graduating May 2026), researching contrastive representation learning for audio deepfake detection at the ISSF Lab under Prof. Hafiz Malik. My thesis is a controlled study of how similarity function choice and negative scaling affect OOD generalization in SSL speech models for deepfake detection tasks achieving 0.25% EER on ASVspoof 2019 and 8.3% on In-the-Wild. My background spans computer vision (segmentation, behavioral cloning, Siamese networks), signal processing (GMRT radio data, star navigation), and a published IEEE paper on optical authentication (ViKey, IEEE MASS 2025). I'm interested in what signals encode about the physical world and how to build models that learn that structure robustly.
Open to full-time roles in audio ML, speech AI, and representation learning starting May 2026.
Research
Controlled study of cosine vs. geodesic similarity and cross-batch negative scaling (|Q|=2048) over frozen XLS-R-300M. Stage 1: SupCon projection head. Stage 2: linear BCE classifier. Ablations: temperature sweep, queue-size sweep, UMAP geometry. Evaluated across FakeXpose and FamousFigures OOD corpora.
From-scratch replication of a two-stage SSL deepfake detection framework using Wav2Vec2 and WavLM backbone variants. Stage 1 learns compressed style/linguistic representations; Stage 2 trains a classifier with waveform augmentations (noise, reverb, SpecAugment). Built per-speaker embedding diagnostics and UMAP/t-SNE visualizations to surface entity-level failure modes invisible in aggregate EER.
Proposed the first passive visible light tag for door access control, exploiting polarized birefringence in layered transparent tapes to generate 3D position-dependent color patterns as unclonable keys. Conducted a systematic case study of tag shape, tape layers, and stacking sequences, and designed a real-time CV pipeline using per-channel SIFT, FLANN matching, RANSAC geometric verification, and temporal consistency checks achieving 90.5% accuracy at 0.5m and <100ms latency without any deep learning.
Projects
Tkinter interface for SAM (ViT-H/L/B). Point, box, and text prompts with real-time mask visualization on consumer hardware.
Lat/lon estimation via star-catalog matching. Template, SIFT/ORB homography, and grid-based strategies combined for rotation/scale robustness.
Experience
Research Assistant — Speech ML / Audio Anti-Spoofing
Advisor: Prof. Hafiz Malik
Research Assistant — Computer Vision & ML Systems
Advisor: Prof. Xiao Zhang
Research Assistant — Signal Processing + ML
Advisor: Prof. Narendranath Muthumani
Publications
Jaskirat Sudan, Fatima Qasem, Hasky E Fynn, Fatima Mohammed, Ashwin Sarvadey, Tian Xie, Ang Li, Xiao Zhang
Low-cost, privacy-preserving door access control via visible-light backscatter. Polarized birefringence tags create 3D, position-dependent color patterns as unclonable optical keys. The <$0.20 prototype achieves ~80% auth accuracy at 0.5m while eliminating cloning, replay, and privacy attack surfaces.
Hasky Fynn, Jaskirat Sudan, Fatima Qasem, Fatima Mohammed, Xiao Zhang
Companion demo paper presenting the live ViKey system, the first visible light backscatter-based door access control system using polarized birefringence to generate 3D position-dependent color patterns as keys, enabling robust and contactless authentication.
Safdar Sardar Khan*, Jaskirat Singh Sudan*, Anuj Pathak*, Rakesh Pandit, Pinky Rane, Ashish Kumar Kumawat (* equal contribution)
Systematic survey of 25+ EEG denoising methods across statistical, ICA-based, wavelet-based, and DL-based approaches for BCI applications. Hybrid ICA + wavelet pipelines consistently outperform single-method approaches for preserving neural signal quality in real-time pipelines.
Blog
How radio telescopes see what eyes can't, revealing pulsars, black holes, and interstellar gas to map the invisible universe.
How an early-warning algorithm and crucial human judgment prevented a Cold War false alarm from escalating into catastrophe.