cv

Basics

Name Mirza Farhan Bin Tarek
Label Ph.D. Student
Email mfarhan@udel.edu
Url https://mirzafarhan7.github.io
Summary I am a dedicated and passionate individual pursuing a Ph.D. in Computer Science at the University of Delaware. My research interests lie at the intersection of machine learning, healthcare, and fairness. Currently, my research focuses on harnessing large language models (LLMs) to generate fair and unbiased clinical data, such as clinical vignettes and notes. I thrive on challenges and am constantly motivated to expand my knowledge and skill set.

Work

  • 2023.08 - Present
    Instructor (On Contract)
    University of Delaware
    As the instructor of CISC181- Introduction to Computer Science II, I am responsible for giving lectures to students, grading the assessments, and designing and updating the curriculum.
  • 2022.08 - Present
    Graduate Teaching Assistant
    University of Delaware
    I have served as the Infrastructure Teaching Assistant (TA), responsible for assisting the Academic Advisor with the development of a web-based application designed to streamline the process of sending offer letters to a diverse group of TAs while efficiently managing their office hours scheduling in addition to other functionalities. It involved leveraging a tech stack that includes HTML, CSS, JavaScript, FastAPI, Jinja2, and Python to create a user-friendly and automated system that enhances the TA onboarding experience and ensures seamless communication with teaching staff.

Education

  • 2022.08 - Present

    Delaware, USA

    Ph.D.
    University of Delaware
    Computer Science
    • CISC 621 ALGORITHM DESIGN AND ANALYSIS
    • CISC 637 DATABASE SYSTEMS
    • CISC 655 COMM SKILLS FOR CS RESEARCH
    • CISC 662 COMPUTER SYSTEMS: ARCHITECTURE
    • CISC 681 ARTIFICIAL INTELLIGENCE
    • CISC 684 MACHINE LEARNING
    • CISC 849 ADV TPCS IN COMPUTER APPLICATION: Research in Accessible Computing
    • CISC 889 ADVANCED DEEP LEARNING
  • 2017 - 2018

    Staffordshire, UK

    Individual Modules (Erasmus+ KA1 Mobility Program)
    Staffordshire University
    Computer Science
  • 2014 - 2018

    Dhaka, Bangladesh

    Bachelor of Science (BSc)
    United International University, Dhaka, Bangladesh
    Computer Science and Engineering

Awards

Certificates

Web development using ASP.NET
CDIP (Centre for Development of IT Professionals), United International University, Dhaka, Bangladesh 2017

Publications

  • 2023
    Improving fairness in AI models on electronic health records: The case for federated learning methods
    Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency
    Developing AI tools that preserve fairness is of critical importance, specifically in high-stakes applications such as those in healthcare. However, health AI models' overall prediction performance is often prioritized over the possible biases such models could have. In this study, we show one possible approach to mitigate bias concerns by having healthcare institutions collaborate through a federated learning paradigm (FL; which is a popular choice in healthcare settings). While FL methods with an emphasis on fairness have been previously proposed, their underlying model and local implementation techniques, as well as their possible applications to the healthcare domain remain widely underinvestigated. Therefore, we propose a comprehensive FL approach with adversarial debiasing and a fair aggregation method, suitable to various fairness metrics, in the healthcare domain where electronic health records are used. Not only our approach explicitly mitigates bias as part of the optimization process, but an FL-based paradigm would also implicitly help with addressing data imbalance and increasing the data size, offering a practical solution for healthcare applications. We empirically demonstrate our method's superior performance on multiple experiments simulating large-scale real-world scenarios and compare it to several baselines. Our method has achieved promising fairness performance with the lowest impact on overall discrimination performance (accuracy).
  • 2018
    Spatio-temporal analysis of large air pollution data
    2018 10th International Conference on Electrical and Computer Engineering (ICECE), IEEE
    Air pollution is one of the most dangerous environmental threats in our planet. Although it is severe in highly populated and industrialized cities of developing countries, it is a major concern for developed countries as well. In the developed world, air quality data is gathered from a large number of air pollution monitoring stations. However, the volume of data is very high and it is not possible to analyze the data efficiently in real-time using the conventional methods. Hence, large scale data mining techniques can help in analyzing those data more efficiently and dynamically. In this paper, a method for mining large amount of air pollution data is proposed for finding air pollution hot spots and time of pollution using clustering methods and time-series analysis. The results, after using the method to the air pollution data of PM 2 . 5 , PM 10 and ozone in the United Kingdom from 2015-17, has shown that the pollution due to particulate matters was higher in winter season and ozone pollution had downward trend except some areas.

Projects

Skills

Programming Languages
Python
Java
C
Database
MySQL
ORACLE
Web Dev and Software Engineering
HTML
CSS
FASTAPI
JINJA2
Machine Learning
ML libraries like scikit-learn, PyTorch
Misc.
LATEX typesetting and publishing
Academic Research

References

Dr. Rahmatollah Beheshti, Assistant Professor, Department of Computer & Information Sciences, University of Delaware