Theo Abbruscato

Research & Data Analyst | OSINT & Language Technology

About

I am an interdisciplinary analyst earning degrees in linguistics, international relations (intelligence, security, and diplomacy), and a certificate in symbolic, cognitive, and linguistic systems, seeking opportunities in research analysis, OSINT, and language / data driven roles.I have experience conducting qualitative and quantitative research using Python (Jupyter, Pandas, SudachiPy), SQL, and other programming languages such as Java and C#. My work focuses on analyzing language, information, and open-source data to identify patterns, extract insights, and support evidence-based analysis.I am a native English speaker with working proficiency in Spanish, reading proficiency in French, Italian, and Portuguese, and actively developing proficiency in German and Mandarin Chinese. Additionally, I have 3 years of experience in post-secondary ESL education.


My Projects

Come back soon! :)

  • Tokenization Challenges in Japanese: Morphology, Orthography, and NLP Segmentation - Undergraduate presenter at Arizona State University's Graduate Linguistics, Applied Linguistics, and TESOL Symposium (2026).

  • Tools used: Python, SudachiPy, MeCab (UniDic / IPADIC), spaCy, GiNZA, Hugging Face tokenizer, Pandas, Jupyter, & Matplotlib.


Let's get in contact: