Research
My research focuses on software engineering, security, and applied machine learning. Currently, I am working on the security of automated code-generation techniques. See my recent publications for more details.
Peer Reviewed Conference Publications
-
Nishat Raihan, Mohammed Latif Siddiq, Joanna C.S. Santos and Marcos Zampieri.
Large Language Models in Computer Science Education: A Systematic Literature Review.
56th ACM Technical Symposium on Computer Science Education (SIGCSE TS 2025), March, 2025.[Acceptance Rate: 33%]
[Preprint,
]
×
BibTex
@inproceedings{nishat2025llmeducation, author={Raihan, Nishat and Siddiq, Mohammed Latif and Santos, Joanna C. S. and Zampieri, Marcos}, title={Large Language Models in Computer Science Education: A Systematic Literature Review}, booktitle = {Proceedings of the 56th ACM Technical Symposium on Computer Science Education (SIGCSE TS '25)}, numpages = {7}, location = {Pittsburgh, Pennsylvania, United States}, series = {SIGCSE TS '25'} }
-
Mohammed Latif Siddiq, Simantika Dristi, Joy Saha, Joanna C. S. Santos.
The Fault in our Stars: Quality Assessment of Code Generation Benchmarks.
24th IEEE International Conference on Source Code Analysis and Manipulation (SCAM 2024), October, 2024.[
Preprint,
]
×
BibTex
@inproceedings{siddiq2024benchmarks, author={Siddiq, Mohammed Latif and Dristi, Simantika Bhattacharjee and Saha, Joy and Santos, Joanna C. S.}, booktitle={24th IEEE International Conference on Source Code Analysis and Manipulation (SCAM)}, title={The Fault in our Stars: Quality Assessment of Code Generation Benchmarks}, doi = {10.1109/SCAM63643.2024.00028}, year={2024}, }
-
Mohammed Latif Siddiq, Beatrice Casey, Joanna C. S. Santos.
FRANC: A Lightweight Framework for High-Quality Code Generation.
24th IEEE International Conference on Source Code Analysis and Manipulation (SCAM 2024), October, 2024.
[Preprint,
]
×
BibTex
@inproceedings{siddiq2024franc, author={Siddiq, Mohammed Latif and Casey, Beatrice and Santos, Joanna C. S.}, booktitle={24th IEEE International Conference on Source Code Analysis and Manipulation (SCAM)}, title={FRANC: A Lightweight Framework for High-Quality Code Generation}, doi = {10.1109/SCAM63643.2024.00020}, year={2024}, }
-
Mohammed Latif Siddiq, Joanna C. S. Santos, Ridwanul Hasan Tanvir, Noshin Ulfat, Fahmid Al Rifat, and Vinicius Carvalho Lopes.
Using Large Language Models to Generate JUnit Tests: An Empirical Study.
28th International Conference on Evaluation and Assessment in Software Engineering (EASE), June, 2024.
[Acceptance Rate: 21%] [Preprint,
]
×
BibTex
@inproceedings{siddiq2024junit, author={Siddiq, Mohammed Latif and Santos, Joanna C. S. and Tanvir, Ridwanul Hasan and Ulfat, Noshin and Rifat, Fahmid Al and Lopes, Vinicius Carvalho}, title={Using Large Language Models to Generate JUnit Tests: An Empirical Study}, booktitle = {Proceedings of the 28th International Conference on Evaluation and Assessment in Software Engineering}, pages = {313–322}, numpages = {10}, keywords = {junit, large language models, test generation, test smells, unit testing}, location = {Salerno, Italy}, series = {EASE '24} }
-
Mohammed Latif Siddiq, Jiahao Zhang, and Joanna C. S. Santos.
Understanding Regular Expression Denial of Service (ReDoS): Insights from LLM-Generated Regexes and Developer Forums.
32nd IEEE/ACM International Conference on Program Comprehension (ICPC 2024), April, 2024.
[Acceptance Rate: 29%] [Preprint,
]
×
BibTex
@inproceedings{siddiq2024regex, author={Siddiq, Mohammed Latif and Zhang, Jiahao and Santos, Joanna C. S.}, booktitle={32nd IEEE/ACM International Conference on Program Comprehension (ICPC 2024)}, title={Understanding Regular Expression Denial of Service (ReDoS): Insights from LLM-Generated Regexes and Developer Forums}, year={2024}, doi={10.1145/3643916.3644424} }
-
Mohammed Latif Siddiq, Shafayat Hossain Majumder, Maisha Rahman Mim, Sourov Jajodia, and Joanna C. S. Santos.
An Empirical Study of Code Smells in Transformer-based Code Generation Techniques.
22nd IEEE International Working Conference on Source Code Analysis and Manipulation (SCAM 2022), October, 2022.
[Acceptance Rate: 38.64%] [Preprint, Presentation, ]
×
BibTex
@inproceedings{siddiq2022empirical, author={Siddiq, Mohammed Latif and Majumder, Shafayat H. and Mim, Maisha R. and Jajodia, Sourov and Santos, Joanna C. S.}, booktitle={2022 IEEE 22nd International Working Conference on Source Code Analysis and Manipulation (SCAM)}, title={An Empirical Study of Code Smells in Transformer-based Code Generation Techniques}, year={2022}, volume={}, number={}, pages={71-82}, doi={10.1109/SCAM55253.2022.00014} }
-
Mohammed Latif Siddiq, Md. Rezwanur Rahman Jahin, Mohammad Rafid Ul Islam, Rifat Shahriyar, and Anindya Iqbal.
SQLIFIX: Learning Based Approach to Fix SQL Injection Vulnerabilities in Source Code.
28th IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER 2021), March, 2021.
[Acceptance Rate: 25%] [Preprint, Presentation, ]
×
BibTex
@inproceedings{siddiq2021sqlifix, title={SQLIFIX: Learning based approach to fix SQL injection vulnerabilities in source code}, author={Siddiq, Mohammed Latif and Jahin, Md Rezwanur Rahman and Islam, Mohammad Rafid Ul and Shahriyar, Rifat and Iqbal, Anindya}, booktitle={2021 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)}, pages={354--364}, year={2021}, organization={IEEE} }
Peer Reviewed Workshop and Special Track Publications
-
Mohammed Latif Siddiq.
Advancing Secure and Standard Source Code Generation Techniques.
47th International Conference on Software Engineering, Doctoral Symposium (ICSE DS 2025), April, 2025.
[Preprint, ]
×
BibTex
@inproceedings{siddiq2025Advancing, author={Siddiq, Mohammed Latif}, title={Advancing Secure and Standard Source Code Generation Techniques}, booktitle = {Proceedings of the 47th International Conference on Software Engineering, Doctoral Symposium (ICSE DS '25)}, numpages = {4}, location = {Ottawa, Canada}, }
-
Mushfiqur Rahman and Mohammed Latif Siddiq.
Code Comment Classification with Data Augmentation and Transformer-Based Models.
4th International Workshop on NL-based Software Engineering (NLBSE 2025), April, 2025.
[Preprint, ]
×
BibTex
@inproceedings{codecomment25mushfiqur, author={Rahman, Mushfiqur and Siddiq, Mohammed Latif}, title={Code Comment Classification with Data Augmentation and Transformer-Based Models}, booktitle = {Proceedings of the 4th International Workshop on NL-based Software Engineering (NLBSE 2025)}, numpages = {4}, location = {Ottawa, Canada}, }
-
Mohammed Latif Siddiq, Joanna C. S. Santos, Sajith Devareddy, and Anna Muller.
SALLM: Security Assessment of Generated Code.
6th International Workshop on Automated and verifiable Software sYstem Development (ASYDE) Co-located with ASE 2024, October, 2024.
[Preprint, ]
×
BibTex
@inproceedings{siddiq2024sallm, author={Siddiq, Mohammed Latif and Santos, Joanna C. S. and Devareddy, Sajith and Muller, Anna}, title={SALLM: Security Assessment of Generated Code}, booktitle = {Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering Workshops (ASEW '24)}, numpages = {12}, location = {Sacramento, CA, USA}, doi = {10.1145/3691621.3694934} series = {ASEW '24} }
-
Mohammed Latif Siddiq, Lindsay Roney, Jiahao Zhang, and Joanna C. S. Santos.
Quality Assessment of ChatGPT Generated Code and their Use by Developers.
21st International Conference on Mining Software Repositories, Mining Challenge Track (MSR 2024), April, 2024.[Acceptance Rate: 45%]
[Preprint, ]
×
BibTex
@inproceedings{siddiq2024devgpt, author={Siddiq, Mohammed Latif and Roney, Lindsay and Zhang, Jiahao and Santos, Joanna C. S.}, booktitle={Proceedings of the 21st International Conference on Mining Software Repositories, Mining Challenge Track (MSR 2024)}, title={Quality Assessment of ChatGPT Generated Code and their Use by Developers}, year={2024} }
-
Mohammed Latif Siddiq, Jiahao Zhang, Lindsay Roney, and Joanna C. S. Santos.
Re(gEx|DoS)Eval: Evaluating Generated Regular Expressions and their Proneness to DoS Attacks.
46th International Conference on Software Engineering - New Ideas and Emerging Results Track (ICSE-NIER 2024), April, 2024.[Acceptance Rate: 23%]
[Preprint, ]
×
BibTex
@inproceedings{siddiq2024regexeval, author={Siddiq, Mohammed Latif and Zhang, Jiahao and Roney, Lindsay and Santos, Joanna C. S.}, booktitle={Proceedings of the 46th International Conference on Software Engineering, NIER Track (ICSE-NIER '24)}, title={Re(gEx|DoS)Eval: Evaluating Generated Regular Expressions and their Proneness to DoS Attacks}, year={2024} }
-
Mohammed Latif Siddiq, Abdus Samee, Sk Ruhul Azgor, Md. Asif Haider, Shehabul Islam Sawraz, and Joanna Cecilia da Silva Santos.
Zero-shot Prompting for Code Complexity Prediction Using GitHub Copilot.
2nd Intl. Workshop on Natural Language-based Software Engineering Co-located with ICSE 2022, May 2023.
[Preprint, ]
×
BibTex
@inproceedings{siddiq2023zero, author={Siddiq, Mohammed Latif and Samee, Abdus and Azgor, Sk Ruhul and Haider, Md. Asif and Sawraz, Shehabul Islam and Santos, Joanna C. S.}, booktitle={2023 IEEE/ACM 2nd International Workshop on Natural Language-Based Software Engineering (NLBSE)}, title={Zero-shot Prompting for Code Complexity Prediction Using GitHub Copilot}, year={2023}, volume={}, number={}, pages={56-59}, doi={10.1109/NLBSE59153.2023.00018} }
-
Mohammed Latif Siddiq, and Joanna Cecilia da Silva Santos.
SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques.
1st Intl. Workshop on Mining Software Repositories Applications for Privacy and Security Co-located with ESEC/FSE 2022, Nov 2022.
[Preprint, ]
×
BibTex
@inproceedings{siddiq2022seceval, author = {Siddiq, Mohammed Latif and Santos, Joanna C. S.}, title = {SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques}, year = {2022}, isbn = {9781450394574}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, url = {https://doi.org/10.1145/3549035.3561184}, doi = {10.1145/3549035.3561184}, booktitle = {Proceedings of the 1st International Workshop on Mining Software Repositories Applications for Privacy and Security}, pages = {29–33}, numpages = {5}, keywords = {dataset, common weakness enumeration, security, code generation}, location = {Singapore, Singapore}, series = {MSR4P&S 2022} }
-
Mohammed Latif Siddiq, and Joanna Cecilia da Silva Santos.
BERT-Based GitHub Issue Report Classification.
1st Intl. Workshop on Natural Language-based Software Engineering Co-located with ICSE 2022, May 2022.
[Preprint, Presentation, ]
×
BibTex
@inproceedings{siddiq2022bert, author = {M. Siddiq and J. C. S. Santos}, booktitle = {2022 IEEE/ACM 1st International Workshop on Natural Language-Based Software Engineering (NLBSE)}, title = {BERT-Based GitHub Issue Report Classification}, year = {2022}, volume = {1}, issn = {978-1-4503-9343-0}, pages = {33-36}, keywords = {software maintenance;conferences;computer bugs;machine learning;natural language processing;labeling;task analysis}, doi = {10.1145/3528588.3528660}, url = {https://doi.ieeecomputersociety.org/}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, month = {may} }
arXiv Publications
- Rishov Paul, Md. Mohib Hossain, Mohammed Latif Siddiq, Masum Hasan, Anindya Iqbal, Joanna C. S. Santos. Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering. ArXiv Preprint, July, 2023. [Preprint]
- Waqar Hassan Khan, Md Al Imran, Ahmed Nafis Fuad, Mohammed Latif Siddiq, ABM Islam. Shashthosheba: Dissecting Perception of Bangladeshi People towards Telemedicine Apps through the Lens of Features of the Apps. ArXiv Preprint, May, 2022. [Preprint]