End-to-end system for detecting deepfake speech using multi-modal fusion of Wav2Vec2, Mel-CNN, and MFCC-BiLSTM models. deepfake_audio_project/ ├── data/ │ ├── raw/real/ # Original real audio (.wav) │ ...