Theo Abbruscato

East Asia Analyst | OSINT & CJK Information Environments | Linguistics + NLP

About

I’m an East Asia–focused analyst specializing in CJK information environments, OSINT, and language-driven analysis. My work focuses on how political narratives are constructed and disseminated across Chinese, Japanese, and Korean media ecosystems.I combine training in International Relations (intelligence, security, diplomacy) and Linguistics with applied experience in Python-based text analysis (spaCy, SudachiPy, Pandas) and NLP workflows. My research includes a presentation on Japanese tokenization at Arizona State University’s Graduate Linguistics Symposium.I am currently developing projects analyzing multilingual political discourse and information operations across East Asian contexts, with a focus on identifying patterns in narrative framing, keyword usage, and cross-lingual differences.I am currently seeking opportunities in:
• OSINT & information environment analysis
• East Asia geopolitical or risk analysis
• CJK‑focused LLM evaluation / linguistic QA
• Tech‑policy, AI governance, or language‑driven research


Education

Arizona State University - Bachelor's of Arts in English (Linguistics) - May 2027Arizona State University - Bachelor's of Arts in International Relations (Intelligence, Security, and Diplomacy) - May 2027Arizona State University - Certificate in Symbolic, Cognitive, and Linguistic Systems - May 2027Mesa Community College - Associate of Arts in English (Creative Writing) - December 2024Mesa Community College - Certificate in Language Studies - Decemeber 2024


Projects

Tokenization Challenges in Japanese: Morphology, Orthography, and NLP Segmentation (Feb 2026)
Tools: spaCy, Jupyter, Python, Pandas, Matplotlib
• Undergraduate presenter at Arizona State University's Graduate Linguistics, Applied Linguistics, and TESOL Symposium
• Analyzed segmentation challenges in Japanese across different tokenization approaches
• Compared morphological vs rule-based tokenization outputs
Cross-Lingual Narrative Analysis: Chinese vs English Media (US-China Relations) (Upcoming)
Tools: Python, Pandas, spaCy / jieba, basic visualization
• Collected and analyzed news articles from Chinese and English-language sources
• Compared keyword frequency, framing patterns, and narrative emphasis
• Identified differences in how the same geopolitical issue is presented across languages
• Produced a short OSINT-style analytical brief
Political Discourse Keyword Analysis (Jan 2026)
Tools: Python, Regex, spaCy, Jupyter, GitHub


Experience

War and Culture in Central Europe: Empire or Liberal Democracy? - Study abroad experience in Bucharest, Brașov, and Constanța, Romania (August 2026)Writing, English, and ESL Tutor Mesa Community College (October 2022 - Present)
• Worked with multilingual students across proficiency levels to analyze and improve written communication
• Identified recurring linguistic patterns and errors across ESL populations
• Provided structured feedback on argumentation, clarity, and language use
• Developed strong skills in explaining complex language concepts clearly and efficiently
Data Analyst Apprentice Global Career Accelerator (Coding for Data) (January 2026 - May 2026)
• Completed training in Python-based data analysis workflows
• Worked with structured datasets using Pandas
• Built foundational skills in data cleaning, analysis, and visualization
Writing Mentor (Internship) Arizona State University (December 2025 - May 2026)
• Supported students in course-embedded writing instruction
• Assisted with research, structure, and clarity in academic writing
• Reinforced analytical thinking and communication skills
Private ESL Tutor Freelance (August 2024 - May 2025)


Skills

Technical (Learning & Applied):
Python (text processing, tokenization, data analysis)
SQL (querying, joins, analytics)
NLP tools: SudachiPy, spaCy, MeCab, Hugging Face tokenizers
Jupyter, Pandas, Regex, Streamlit
AI/LLM evaluation concepts
Git/GitHub
Research and Analysis
OSINT & information environment analysis
CJK media monitoring
Linguistic analysis (morphology, syntax, discourse)
Political discourse analysis
Corpus analysis
Research design & literature review
Languages
English: Native
Spanish: Limited working proficiency
Mandarin: Developing proficiency
French, Italian, Br-Portuguese: Reading proficiency
Japanese, Korean: Light familiarity
Communication
3.5 years tutoring (writing, ESL)
Academic writing
Cross‑cultural communication
Presentation & public speaking


I’m open to research, OSINT, risk, and CJK‑focused tech rolesLet's get in contact: