WORK EXPERIENCE
Software Engineer
Apurba Technologies Ltd 12/2024 - Present Dhaka, Bangladesh- Developed scalable RESTful APIs using FastAPI and Node.js while containerizing them in Docker and deploying them to AWS.
- Optimized query performance, reducing retrieval and data creation times by 5% by implementing indexing strategies.
- Led the migration of monolith applications to a service-oriented architecture, improving system modularity and deployment efficiency.
- Built a permissioned blockchain network from scratch.
- Authored unit, integration, load, and stress tests.
- Collaborated with companies and government agencies to implement, research, and complete project deliverables.
Lecturer (Contractual)
United International University 06/2024 - 10/2024 Dhaka, Bangladesh- Taught core CS concepts to students.
- Took class tests and term exams to monitor students' performance and provide informative feedback.
- Worked with the Dept. of CSE to audit students' marks for BAETE/IEB accreditation.
- Courses conducted: Object-Oriented Programming Lab, Data Structures and Algorithms 1 Lab, and Database Management Systems Theory.
Undergraduate Assistant
United International University 09/2023 - 05/2024 Dhaka, Bangladesh- Managed to facilitate course content and materials with the lab faculty, checked and evaluated assessments, and judged lab projects.
- Created the very first course materials for the country's first-ever undergraduate degree in Data Science.
- Courses supervised: Programming for Data Science, and Object-Oriented Programming for Data Science.
PROFESSIONAL SKILLS


EDUCATION
B.Sc in Computer Science & Engineering (Major: Data Science)
United International University
(2020 - 2024)
CGPA: 3.77/4.00, Major CGPA: 4.00/4.00
Thesis Title: AVLoS: Audio-Visual Long Text Scene Summarization
Thesis Supervisor: Prof. Dr. Swakkhar Shatabda
RESEARCH EXPERIENCE
An audio video-based multi-modal fusion approach for speech emotion recognition (View Project Page)
Co-supervised by: Md. Rayhan Ahmed, Dr. Salekul Islam, Dr. Swakkhar Shatabda, and Dr. A.K.M Muzahidul Islam.
Under Review at Knowledge-Based Systems (Elsevier) | [codebase]
We present an approach to classify human emotions by fusing audio and visual inputs. Our approach sets new state-of-the-art results while keeping the architecture very simple. Additionally, we present a new frame filtering strategy to overcome the problem of spatiotemporal redundancy.
AVLoS: Audio-Visual Long Text Scene Summarization
Supervised by: Dr. Swakkhar Shatabda.
Thesis Work. In Progress.
We are experimenting on fine-grained long text scene summarization from videos using video and audio inputs. Most approaches use queries to guide text generation, we are experimenting to generate text without these queries.
MIMIC: Multimodal Islamophobic Meme Identification and Classification
Supervised by: Nahid Hossain.
Accepted at MusIML Workshop - NeurIPS 2024. | [codebase] [arxiv]
Anti-Muslim hate speech has emerged within memes, characterized by context-dependent and rhetorical messages using text and images that seemingly mimic humor but convey Islamophobic sentiments. This work presents a novel dataset and proposes a classifier based on the Vision-and-Language Transformer (ViLT) specifically tailored to identify anti-Muslim hate within memes by integrating both visual and textual representations.
BhaShammo: IPA Transcription of Bengali Regional Dialect using Dialect Guided Tokens
Co-supervised by: Dr. Swakkhar Shatabda and Dr. Farig Sadeque.
In Progress. | [codebase] [arxiv]
We present an approach to transcribe regional Bengali text to IPA by introducing the Dialect Guided Tokens (DGT) technique on a new dataset spanning six districts of Bangladesh. We provide the model with information on the regional dialect of the input text before generating the IPA transcription. This is the first time this problem has been solved.
E-MedViTR: Enhanced Vision Transformers with Registers for Biomedical Image Classification
Supervised by: Dr. Dewan Md. Farid.
Published at ICEEICT 2024. | [paper]
We investigated the effectiveness of the ViT with registers (aka DINO v2) in classifying medical pathology images. Normally, it doesn't perform up to the mark as SOTA models do, but with an extension and data augmentations, it performs relatively close to SOTA models.
PUBLICATIONS
Journal Papers (Under Review)
S M Jishanul Islam, Sahid Hossain Mustakim, Musfirat Hossain, Mysun Mashira, Nur Islam Shourav, MD. Rayhan Ahmed, Salekul Islam, Swakkhar Shatabda, and A.K.M. Muzahidul Islam. 2024. An audio video-based multi-modal fusion approach for speech emotion recognition. Knowledge-Based Systems (Elsevier).
Conference Papers
Sadia Ahmmed, Taimur Rahman, S M Jishanul Islam, Al-Momen Reyad, Sonjoy Dey, James Anthony Purification, and Md. Dewan Farid. 2024. E-MedViTR - Enhanced Vision Transformers with Registers for Biomedical Image Classification. 6th International Conference on Electrical Engineering and Information & Communication Technology (ICEEICT 2024) [Scopus Indexed]. DOI: 10.1109/ICEEICT62016.2024.10534573
Workshop Papers
S M Jishanul Islam, Sahid Hossain Mustakim, Sadia Ahmmed, Md. Faiyaz Abdullah Sayeedi, Swapnil Khandoker, Syed Tasdid Azam Dhrubo, and Nahid Hossain. 2024. MIMIC: Multimodal Islamophobic Meme Identification and Classification. 3rd Muslims in ML Workshop - Neural Information Processing Systems 2024 (NeurIPS 2024). DOI: 10.48550/arXiv.2412.00681
PROJECTS
NurtureAid
Developed a real-time AI-based cross-platform mobile application that simplifies care through caretakers. Led the app's backend and frontend development as a full-stack engineer.
Tech Stack: React Native, Node.js, Flask, MongoDB, Firebase, LLaMA-Index, PyTorch.
Acknowledgements: Champion of the UIU CSE Project Show, Software Engineering Laboratory; Champion of the Hult Prize OnCampus round in UIU; Selected for the Hult Prize Summit in Boston.
Quest Aid
Developed an AI-driven web application to manage the ECAs of students, clubs, universities, and organizations under the same platform. Led the app's backend, frontend, and AI development as a full-stack engineer.
Tech Stack: React.js, Spring Boot, Flask, MySQL, Langchain.
iamspecial.com
Developed a web app built to tackle the lack of misinformation, accessibility and guidance for people with special needs. Led the app's backend and frontend development as a full-stack engineer.
Tech Stack: HTML, CSS, JavaScript, PHP, MySQL.
Acknowledgements: Champion of the UIU CSE Project Show, Database Management Systems Laboratory.
ECMS-Desktop
Developed an ECA Management System for uniting the ECAs of students, clubs, universities, and organizations under the same platform. Led the app's backend and frontend development as a full-stack engineer.
Tech Stack: Java, JavaFX, MySQL.
Acknowledgements: Champion of the UIU CSE Project Show, Advanced Object-Oriented Programming Laboratory.
ACHIEVEMENTS
Winner, Harvard Health Systems Innovation Lab Hackathon 2025 Dhaka Hub April 2025
Academic Excellence Scholarship 2020 β 2024
Champion, Bhashamul: Bengali Regional Text to IPA National NLP Datathon March 2024
Champion, Hult Prize OnCampus Round March 2024
Top 15 Finalist, Hult Prize Summit in Boston, USA June 2024
Finalist, ICT Innovation Grant (for NurtureAid) December 2023
Gold Award (Champion), International Blockchain Olympiad IBCOL 2023 November 2023
Champion, Software Engineering Lab September 2023
Silver Award (1st Runner Up), Bangladesh Blockchain Olympiad BCOLBD 2023 July 2023
1st Runner Up, Intra-University Deep Learning Sprint January 2023
Champion, Database Management Systems Lab May 2022
Champion, Advanced Object Oriented Programming Lab January 2022
The Daily Star Award 2017
Academia High Achievers Award 2017