2025/12/06 –, Main Stream 言語: English
Can data engineering be feminist? Absolutely. As data engineers and Python developers, the decisions we make about what data to collect, how we transform it, and who gets access carry ethical weight. This talk explores how we can apply feminist principles like transparency, consent, and equity to build more ethical data pipelines. Through real-world examples of harm caused by opaque and biased data systems, we’ll examine how even routine ETL work can reinforce injustice. We’ll then outline practical approaches for designing pipelines that center care and accountability covering techniques like schema minimization, anonymization in Python, and documenting data provenance. Attendees will leave with a deeper understanding of how to challenge extractive data practices and use their technical skills to push back against surveillance capitalism and systemic bias.
This talk begins by grounding the audience in the concept of feminist data engineering, drawing from data feminism, intersectional theory, and real-world cases where unethical data practices have caused harm such as biased recommendation systems or invasive tracking tools. We’ll examine how traditional data engineering workflows often prioritize efficiency, scale, and surveillance over consent, context, and care. From there, the talk transitions into a practical framework for ethical data design, introducing tools and techniques that Python engineers can use today. We’ll explore how to implement schema minimization, anonymization, and consent-aware collection in Python using libraries like Pandas and PyArrow. The talk also covers how to build transparency into ETL pipelines through audit logs and documentation practices. Throughout, I’ll include examples from my own experience working with legacy systems and modern pipelines to illustrate how ethical decisions show up in technical details. We’ll close by reflecting on how to advocate for ethical practices within teams and organizations, especially when trade-offs or pushback arise. This session is both a call to awareness and an invitation to action—for engineers who want to build systems that not only work, but do good.
Yashasvi Misra is a software engineer at Pure Storage, mentor, and advocate for ethical tech practices. She builds scalable data pipelines, with a focus on modernizing legacy systems and embedding transparency into data workflows. Outside of engineering, Yashasvi serves as Chair of the NumFOCUS Code of Conduct Working Group and actively mentors women in tech through PyLadies India and Women Techmakers. She’s passionate about creating inclusive communities, ethical engineering, and empowering others to lead with empathy in their code.
