Adib Sakhawat
Portfolio · 2026

Adib
Sakhawat

Software Engineering student at IUT, NLP researcher, backend engineer, and academic builder from Bangladesh — working at the intersection of language, law, and low-resource AI.

Natural Language Processing Large Language Models Legal AI Low-Resource Languages
IUT · SWE Adib Sakhawat — NLP researcher and Software Engineering student at IUT
Location
Dhaka, Bangladesh
Status
Final year · SWE
CGPA
3.90 / 4.00
GitHub
@sakhadib

01 — About

Research, engineering, and the open web.

I am a final-year Software Engineering student at the Islamic University of Technology (IUT), deeply focused on Natural Language Processing, Computational Mathematics, Legal AI, and Large Language Models. My work combines research, engineering, and open-source development with the goal of building impactful systems and publishing internationally recognized research.

I actively work on research-oriented datasets, multilingual NLP systems, information-extraction pipelines, and analytical AI platforms — with a bias toward low-resource languages and culturally grounded benchmarks.

Current Roles
  • Vice President — IUT Computer Society
  • Vice Chair — IEEE IUT Computer Chapter
  • Vice President — IUT Al-Fazari Interstellar Society
  • Final Year Student — Software Engineering, Dept. of CSE, IUT

02 — Research

Areas of inquiry.

Topics I actively read, build, and write papers on.

Natural Language Processing
Large Language Models
Legal AI & Computational Law
Low-Resource Language Tech
Information Extraction
ML Interpretability
Computational Mathematics
AI for Social Good
Crime Intelligence Systems
Multilingual Benchmarking

03 — Publications

Papers & preprints.

Selected work spanning figurative language, political alignment in LLMs, evaluation geometry, and game-theoretic benchmarks for dialogue.

P/01 LREC 2026 · Palma, Spain

When Words Don't Mean What They Say: Figurative Understanding in Bengali Idioms

A culturally grounded dataset of 10,361 Bengali idioms annotated with a 19-field schema capturing semantic, syntactic, cultural, and religious dimensions. Benchmarks 30 multilingual / instruction-tuned LLMs — none exceed 50% accuracy versus 83.4% human.

arXiv:2602.12921
P/02 LLM Alignment Audit

Political Alignment in Large Language Models

Audits 26 contemporary LLMs using three political psychometric inventories (Political Compass, SapplyValues, 8Values) plus a large news-bias labeling task. Shows 96.3% of models cluster in the Libertarian-Left quadrant — model identity, not prompt wording, explains most variance.

arXiv:2601.06194
P/03 Evaluation Framework

Coordinates of Capability: A Unified MTMM-Geometric Framework for LLM Evaluation

A SoK paper merging multitrait–multimethod (MTMM) analysis with geometric representations to organize and interpret LLM evaluation results across many benchmarks. Treats models as points in a capability space and argues for moving from leaderboard fragments to structured, geometry-aware capability maps.

arXiv:2605.08522
P/04 Dialogue Benchmark

AIDG: Asymmetry Between Information Extraction & Containment in Multi-Turn Dialogue

A game-theoretic benchmark with two tasks — social-deduction style AIDG-I and a structured "20 Questions" style AIDG-II. Across 439 games with six frontier models, finds a pronounced asymmetry: containment is far stronger than deduction (~350 ELO defensive advantage).

arXiv:2602.17443
P/05 Persuasion & Resistance

AREG: Adversarial Resource Extraction Game for LLMs

A multi-turn zero-sum negotiation game over financial resources to jointly evaluate persuasion and resistance in LLMs. Shows the two abilities are only weakly correlated; models defend better than they attack.

arXiv:2602.16639

Also: technical articles published in Python in Plain English on Medium.


04 — Projects

Things I've built.

Selected open-source projects across NLP, automation, graphics, and tooling.

PJ/01

AIManim

Turns plain-language math prompts into short Manim animations. Plans the explanation, generates scene code, repairs failed code, and stitches the final video. Supports OpenRouter, OpenAI, Gemini.

Repository
PJ/02

MathRanker

A web platform for mathematical enthusiasts: contests, problem-solving, community engagement. Built with Laravel, Filament, Livewire — featuring a Sergent-to-General rating system and collaborative forums.

Repository
PJ/03

Geon

A zero-dependency JS library that renders 2D mathematical graphics from a natural-language-like DSL into SVG in the browser — declarative geometry, plotting, intersection solving, smart labeling.

PJ/04

ZotecoRD

A Discord bot bridging Zotero with Discord and Google Sheets. Extracts color-coded PDF annotations and routes them to categorized channels — yellow → methods, green → contributions, blue → results, purple → claims, red → limitations.

Repository
PJ/05

Crime Data Scraper

AI-powered scraper extracting crime news from 40 verified sources across 6 continents. Uses spaCy NER and crime classification, SHA-256 + MD5 dedup, and daily GitHub Actions runs. Outputs an 18-column structured CSV.

Repository
PJ/06

CNN Comparison on Hand Gesture

A comparative study of 15 pretrained CNN architectures for character-level static hand-gesture recognition (37 classes, 1,500 images each). Includes Grad-CAM, LIME, t-SNE, and full evaluation suites.

Repository
PJ/07

Contexto

A .NET 6+ CLI that analyzes directory structures and generates XML reports about a codebase. Produces folders.xml, files.xml, stat.xml, and a combined complete.xml report — with intelligent exclusion of build folders.

Repository
PJ/08

MathVoyage (vmath)

A Java toolkit covering algebra, trigonometry, matrices, vectors, number systems, combinatorics, coordinate geometry, and bitwise ops — with JavaDoc and a JAR release.

Repository

05 — Datasets

Curated datasets.

Public datasets I've built and released on Kaggle to support research in low-resource NLP, legal AI, and game simulation.

DS/01 10,361 entries

Bangla Bagdhara

The largest computational resource for Bengali idioms — 19-field schema, 37,129 semantic tag assignments, sentiment & cultural metadata. Accompanies the LREC 2026 paper on figurative understanding.

DS/02 1,484+ acts

Bangladesh Legal Acts Dataset

Bangladesh's legal framework (1799–2025) — 35,633 sections, 14,523 footnotes, government & legal-system context. Multilingual: English, Bengali, mixed. CC-BY 4.0.

DS/03 100,000 tournaments

Snake & Ladder Game Simulation

100,000 simulated tournaments across 16 psychologically-inspired AI agents — group stage, knockouts, and full replay sequences. ~2.3GB CSV for behavioral modeling and strategy analysis.

DS/04 7,117 headlines

Crime Headline Binary Classification

A perfectly balanced dataset (3,562 crime / 3,562 non-crime) for binary classification — useful for logistic regression, decision trees, and deep-learning baselines.

DS/05 2004 – 2024

Bangladesh Math Olympiad Question Set

Two decades of BdMO papers, organized by year, level (Primary → Higher Secondary), and round (Divisional, Regional, National, Selection).


06 — Engineering

Stack & practice.

What I Build
Backend systems NLP pipelines Dataset construction Web scrapers AI evaluation systems Laravel applications Python automation Analytical platforms
Technologies
Python Laravel / PHP JavaScript / TypeScript C# SQL Jupyter Git & GitHub Linux / WSL2
GitHub
104+
Public Repos
4+
Years on GitHub
250+
Peak Commits / Quarter
12
Languages Used
Most active languages
  1. JavaScript01
  2. Blade02
  3. C#03
  4. Python04
  5. Java05
  6. PHP06
Top repositories by stars
  1. IUT_SWE_QuestionBank01
  2. Ground-News-Scraper02
  3. AiManim03
  4. auto-indent04
  5. MathRanker05
  6. vmath06

07 — Experience

Work & leadership.

  1. Oct 2025 — Mar 2026
    Software Engineering Intern
    Madestic Software Solutions Inc.
  2. 2025 — Present
    Vice Chair
    IUT IEEE Computer Society
  3. 2025 — Present
    Vice President
    IUT Computer Society
  4. 2025 — Present
    Vice President
    IUT Al-Fazari Interstellar Society
  5. 2024 — 2025
    Assistant Director, Technical Affairs
    IUT Computer Society
  6. 2024 — 2025
    Joint Secretary
    IUT Al-Fazari Interstellar Society
  7. 2023 — 2024
    Junior AI/ML Executive
    IUT Computer Society
  8. 2023 — 2024
    Junior Executive, Administration
    IUT Al-Fazari Interstellar Society
  9. 2020 — 2022
    Editor
    Bigganil — A Bangla Science Magazine

08 — Academics

Education & results.

A full record of my undergraduate journey at IUT, alongside earlier academic history. Detailed semester results are tucked behind expandable panels to keep this page calm.

Undergraduate
B.Sc. in Software Engineering
Islamic University of Technology (IUT)
3.90 / 4.00 · CGPA
Higher Secondary
HSC, Science
New Govt Degree College, Rajshahi
5.00 GPA · A+ in all subjects
Secondary
SSC, Science
Govt. Laboratory High School, Rajshahi
5.00 GPA · A+ in all subjects
S/01
First Semester
2021 — 2022
SGPA / CGPA
3.86 · 3.86
CourseCreditGrade
CSE 4104 Engineering Drawing Lab0.75A
CSE 4107 Structured Programming I3A
CSE 4108 Structured Programming I Lab1.5A+
Hum 4142 Arabic I1A+
Hum 4145 Islamiat2A+
Hum 4147 Technology, Environment and Society3A
Math 4141 Geometry and Differential Calculus4A+
Phy 4143 Physics II3A−
Phy 4144 Physics II Lab0.75A+
SWE 4101 Introduction to Software Engineering3A+
S/02
Second Semester
2021 — 2022
SGPA / CGPA
3.93 · 3.89
CourseCreditGrade
CSE 4203 Discrete Mathematics3A+
CSE 4205 Digital Logic Design3A+
CSE 4206 Digital Logic Design Lab0.75A+
Hum 4242 Arabic II1A+
Hum 4247 Accounting3A+
Hum 4249 Business Psychology and Communications3A−
Math 4241 Integral Calculus and Differential Equations4A+
SWE 4201 Object Oriented Concepts I3A+
SWE 4202 Object Oriented Concepts I Lab1.5A+
S/03
Third Semester
2022 — 2023
SGPA / CGPA
3.97 · 3.92
CourseCreditGrade
CSE 4303 Data Structures3A+
CSE 4304 Data Structures Lab1.5A+
CSE 4305 Computer Organization and Architecture3A+
CSE 4307 Database Management Systems3A
CSE 4308 Database Management Systems Lab1A+
CSE 4309 Theory of Computing3A+
Math 4341 Linear Algebra3A+
SWE 4301 Object Oriented Concepts II3A+
CSE 4302 Object Oriented Concepts II Lab1.5A+
SWE 4304 Software Project Lab I1.5A+
S/04
Fourth Semester
2022 — 2023
SGPA / CGPA
3.88 · 3.91
CourseCreditGrade
CSE 4403 Algorithms3A+
CSE 4404 Algorithms Lab1A+
CSE 4409 Database Management Systems II2A
CSE 4410 Database Management Systems II Lab1.5A+
CSE 4411 Data Communication and Networking3A+
CSE 4412 Data Communication and Networking Lab1A+
Hum 4441 Engineering Ethics3B+
Math 4441 Probability and Statistics3A+
SWE 4401 Software Requirements and Specifications3A+
CSE 4402 Software Requirements and Specifications Lab1A+
SWE 4404 Software Project Lab II1.5A+
S/05
Fifth Semester
2023 — 2024
SGPA / CGPA
3.90 · 3.91
CourseCreditGrade
CSE 4501 Operating Systems3A
CSE 4502 Operating Systems Lab1A+
CSE 4553 Machine Learning3A
CSE 4554 Machine Learning Lab0.75A+
Math 4543 Numerical Methods3A
Math 4544 Numerical Methods Lab0.75A+
SWE 4501 Design Patterns2A+
SWE 4502 Design Patterns Lab1A+
SWE 4503 Software Security3A+
SWE 4504 Software Security Lab0.75A+
SWE 4506 Design Project I1.5A+
SWE 4537 Server Programming3A+
SWE 4538 Server Programming Lab0.75A+
S/06
Sixth Semester
2023 — 2024
SGPA / CGPA
3.83 · 3.90
CourseCreditGrade
CSE 4617 Artificial Intelligence3A−
CSE 4618 Artificial Intelligence Lab0.75A+
CSE 4621 S Microprocessor and Interfacing3A
CSE 4622 S Microprocessor and Interfacing Lab0.75A+
Math 4643 Probability and Statistics II3A+
SWE 4601 Software Design and Architectures3A
SWE 4602 Software Design and Architectures Lab0.75A
SWE 4603 Software Testing and Quality Assurance3A
SWE 4604 Software Testing and Quality Assurance Lab1A+
SWE 4606 Design Project II1.5A+
SWE 4637 Web and Mobile Application Development3A+
SWE 4638 Web and Mobile Application Development Lab0.75A+
S/07
Seventh Semester
2024 — 2025
SGPA / CGPA
3.93 · 3.90
CourseCreditGrade
CSE 4714 Technical Report Writing0.75A+
HUM 4747 Legal Issues and Cyber Law3A+
SWE 4700 Project / Thesis1.5A+
SWE 4701 Software Metrics and Process3A+
SWE 4739 Embedded Software Development3A−
SWE 4740 Embedded Software Development Lab0.75A+
SWE 4790 Internship9A+

09 — Beyond

Outside the terminal.

Astronomy & space science

Stars, cosmology, observational astronomy — and the societies that gather around them.

Science fiction

Sci-fi and science-themed media — narrative as a way to think about systems.

Ancient Bengal history

Civilizations, manuscripts, and the long arc of cultural memory in the subcontinent.

Research discussions

Long-form academic conversations — the kind that shape future papers.


10 — Vision

What I'm building toward.

To build impactful AI systems, contribute meaningful research to the NLP community, and develop technologies that help low-resource languages and human ecosystems gain stronger representation in modern AI.


11 — Contact

Let's talk.

Open to research collaboration, internships, and conversations on NLP, legal AI, and low-resource language technology.

sakhadib@gmail.com