NLP @ SCU

classroom

SCU-CSEN346

Activity Feed

AI & ML interests

NLP, AI, Math, Stats, Multimodal, AI4Good, Mentorship, Education

Organization Card

Community About org cards

CSEN 346 – Natural Language Processing (Santa Clara University)

Welcome to the official Hugging Face organization for CSEN 346: Natural Language Processing at Santa Clara University.

This space hosts the final course projects developed by students, including:

NLP/AI models
Datasets
Research papers
Evaluation benchmarks
Demo applications

The goal is to encourage open, reproducible, and responsible NLP research while giving students experience contributing to the research community.

Course Information

Course: CSEN 346 – Natural Language Processing
Institution: Santa Clara University
Instructor: Prof. Oana Ignat
Focus areas:

Large Language Models
Generative AI
Multimodal
Mutliagents
Multilingual
Responsible and Inclusive AI
Human-centered Evaluation
Applied NLP systems

Project Goals

Student projects aim to:

Apply NLP methods to real-world problems
Build reproducible research
Practice research communication
Contribute open resources (models/datasets)
Develop responsible AI systems

Projects typically include:

Problem motivation
Dataset description
Model architecture
Training setup
Evaluation methodology
Error analysis
Future work

Repository Structure

Each project should include:

Required

Project README
Research paper (ACL-style)
Model or dataset
Evaluation results

Documentation Standards

Projects should clearly describe:

Task

What problem does the project solve?

Data

Source
Size
Languages
Limitations

Model

Architecture
Parameters
Training setup

Evaluation

Metrics
Baselines
Results

Ethical Considerations

Bias risks
Limitations
Appropriate use

Responsible AI Guidelines

All projects must follow responsible AI practices:

No private or sensitive data
No harmful applications
Clear documentation of limitations
Proper dataset citations
Transparency in evaluation

Academic Integrity

All work must follow SCU academic integrity policies.

Students must:

Cite all external resources
Clearly indicate use of AI models
Document prompting or fine-tuning approaches
Clearly state team member contributions

Acknowledgment

This course emphasizes that:

Research is most valuable when it is shared, reproducible, and benefits others.

By publishing projects here, students contribute to the broader NLP community while building their research portfolio.

License

Unless otherwise specified, student projects are released for academic and research purposes only.

Contact

Course organization maintained by:

Dr. Oana Ignat
Assistant Professor
Computer Science and Engineering
Santa Clara University

models 0

None public yet

datasets 4