data science history / data science @ NYTchris wiggins
talk delivered 2015-07-29 at ICERM workshop on "mathematics in data science"
workshop: https://icerm.brown.edu/topical_workshops/tw15-6-mds/
references: http://bit.ly/icerm
data science history / data science @ NYTchris wiggins
talk delivered 2015-07-29 at ICERM workshop on "mathematics in data science"
workshop: https://icerm.brown.edu/topical_workshops/tw15-6-mds/
references: http://bit.ly/icerm
"data: past, present, and future" day 1 lecture 2020-01-20chris wiggins
What should our future statisticians, senators, and CEOs know about the history and ethics of data? How might understanding that history provide tools and resources to future citizens navigating a future shaped by data empowered algorithms? We've developed a course that introduces students, without prerequisites, to a historical view of our present condition, in which data-empowered algorithms shape our personal, professional, and political realities. The course attempts to integrate critical data studies with functional engagement with data (in Python via Jupyter notebooks), and interleaves an applied view of ethics throughout. The intellectual arc traces from the 18th century to present day, beginning with examples of contemporary technological advances, disquieting ethical debates, and financial success powered by panoptic persuasion architectures.
Data! Action! Data journalism issues to watch in the next 10 yearsPaul Bradshaw
Keynote at the Nordic data journalism conference #NODA16 - an outline of issues facing data journalism which journalists and academics need to focus on in the next decade.
Váš život je jako zahrada, Zlepšete si svůj život sami, Naše přednášky v Městské knihovně v Praze, bydlení: Proměňte domov v úrodnou zahradu, TAO - pět fází proměny (pět elementů), byznys: Maget na zákazníky, Energie dní na týden od 19.9., Nová kniha: Poznej své pravé já
iBeggar.com is a website devoted to helping people present with personal plea for funds or help through Cyberbegging. iBeggar is a way to ask for monetary help for many diverse reasons whether it be for personal needs or to crowd fund an idea or business start up. The best thing about cyber begging is that no one knows who you are and there is no embarrassment allied with standing up in front of strangers. At iBeggar.com, you can not only beg but also find begging tips for better outcomes.
A partir del 2006, Wave es el estudio más grande sobre el impacto de los medios sociales en el mercado global de hoy en día. Cubriendo 62 países y el 43% de la población de Internet en todo el mundo, Wave 6 – The Business of Social' - es el estudio más ambicioso y sofisticado hasta la fecha. Ofrece una valiosa idea en cuanto a cómo los anunciantes pueden aportar un valor comercial a su negocio a través de estrategias de comunicación social.
"data: past, present, and future" day 1 lecture 2020-01-20chris wiggins
What should our future statisticians, senators, and CEOs know about the history and ethics of data? How might understanding that history provide tools and resources to future citizens navigating a future shaped by data empowered algorithms? We've developed a course that introduces students, without prerequisites, to a historical view of our present condition, in which data-empowered algorithms shape our personal, professional, and political realities. The course attempts to integrate critical data studies with functional engagement with data (in Python via Jupyter notebooks), and interleaves an applied view of ethics throughout. The intellectual arc traces from the 18th century to present day, beginning with examples of contemporary technological advances, disquieting ethical debates, and financial success powered by panoptic persuasion architectures.
Data! Action! Data journalism issues to watch in the next 10 yearsPaul Bradshaw
Keynote at the Nordic data journalism conference #NODA16 - an outline of issues facing data journalism which journalists and academics need to focus on in the next decade.
Váš život je jako zahrada, Zlepšete si svůj život sami, Naše přednášky v Městské knihovně v Praze, bydlení: Proměňte domov v úrodnou zahradu, TAO - pět fází proměny (pět elementů), byznys: Maget na zákazníky, Energie dní na týden od 19.9., Nová kniha: Poznej své pravé já
iBeggar.com is a website devoted to helping people present with personal plea for funds or help through Cyberbegging. iBeggar is a way to ask for monetary help for many diverse reasons whether it be for personal needs or to crowd fund an idea or business start up. The best thing about cyber begging is that no one knows who you are and there is no embarrassment allied with standing up in front of strangers. At iBeggar.com, you can not only beg but also find begging tips for better outcomes.
A partir del 2006, Wave es el estudio más grande sobre el impacto de los medios sociales en el mercado global de hoy en día. Cubriendo 62 países y el 43% de la población de Internet en todo el mundo, Wave 6 – The Business of Social' - es el estudio más ambicioso y sofisticado hasta la fecha. Ofrece una valiosa idea en cuanto a cómo los anunciantes pueden aportar un valor comercial a su negocio a través de estrategias de comunicación social.
The Secrets to Building a Dominant Media Property - Confab MN Central 2014Joe Pulizzi
Joe Pulizzi's presentation on how the Content Marketing Institute evolved and built it's media platform into the leading training and education destination for content marketing. Includes the editorial mission statement, leveraging collaborative publishing, a defined influencer strategy and focusing on the revenue goals.
Turbopromoter.net Video Marketing Services are completely immune to such updates and our customers continue to rank very highly within their chosen niche.
The workshop opens with a discussion of how to repurpose digital "methods of the medium" for social and cultural scholarly research, including its limitations, critiques and ethics. Subsequently participants are trained in using digital methods in hands-on sessions. How to use crawlers for dynamic URL sampling and issue network mapping? How to employ scrapers to create a bias or partisanship diagnostic instrument? We also consider how to deploy online platforms for social research. How to transform Wikipedia from an online encyclopaedia to a device for cross-cultural memory studies? How to make use of social media so as to profile the preferences and tastes of politicians’ friends, and also locate most engaged with content? How to make use of Twitter analytics to debanalize tweets, and provide compelling accounts of events on the ground? Finally, the workshop turns to the question of employing web data and metrics as societal indices more generally.
Introduction for skills seminar on Search and Data Mining, Master of European...Gerben Zaagsma
These are the slides for the introductory lecture that I gave as part of a skills seminar on Search and Data Mining (Luxembourg, 11 December 2014). The slides are rather visual and for the most part don’t include notes, yet I believe the gist of the talk will be clear. At the end links are included for tools, further reading and a link to the exercises we did.
Coding publics and code as ethnographic artefactDan Verständig
SY06: MAKER CULTURES , CODE AND PUBLICNESS
EXPLORING (POST --)DIGITAL SPHERES AND PRACTICES
Dan Verständig | Otto von Guericke University of Magdeburg/Germany
‘Going Public‘? Ethnography in Education and Social Work and Its Publics
October 31st - November 2nd, 2019 Martin Luther University Halle Wittenberg, Germany
Presentation from November 1st, 2019.
Spark Summit Europe: Share and analyse genomic data at scaleAndy Petrella
Share and analyse genomic data
at scale with Spark, Adam, Tachyon & the Spark Notebook
Sharp intro to Genomics data
What are the Challenges
Distributed Machine Learning to the rescue
Projects: Distributed teams
Research: Long process
Towards Maximum Share for efficiency
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...Stefan Dietze
Inaugural lecture at Heinrich-Heine-University Düsseldorf on 28 May 2019.
Abstract:
When searching the Web for information, human knowledge and artificial intelligence are in constant interplay. On the one hand, human online interactions such as click streams, crowd-sourced knowledge graphs, semi-structured web markup or distributional semantic models built from billions of Web documents are informing machine learning and information retrieval models, for instance, as part of the Google search engine. On the other hand, the very same search engines help users in finding relevant documents, facts, or data for particular information needs, thereby helping users to gain knowledge. This talk will give an overview of recent work in both of the aforementioned areas. This includes 1) research on mining structured knowledge graphs of factual knowledge, claims and opinions from heterogeneous Web documents as well as 2) recent work in the field of interactive information retrieval, where supervised models are trained to predict the knowledge (gain) of users during Web search sessions in order to personalise rankings. Both streams of research are converging as part of online platforms and applications to facilitate access to data(sets), information and knowledge.
DN18 | From Counting to Connecting: A Networked and Data-Driven Approach to M...Dataconomy Media
Abstract of the Presentation:
This talk will focus on applications of knowledge graphs and network science to the exploration of distributed industrial capabilities relevant to support work on United Nations Sustainable Development Goals. The core message and tools presented during the talk are relevant beyond sustainable development and can be applied to inter-organisational collaboration projects, information flows within companies and innovation management.
About the Author:
Pedro Parraguez is the Co-founder of Dataverz, a data analytics company based in Copenhagen, and Postdoctoral Researcher at DTU Management Engineering. Pedro’s research and applied work focuses on complex socio-technical systems, with emphasis on network science and data-driven analyses. This includes the study and development of decision-making support for industrial clusters, complex organisations, and large engineering projects.
WLMA 14 Conference Keynote PPT - Paige Jaeger: Connecting Creatively with the CCPaige Jaeger
Washington Library Media Association Conference Keynote - It was my pleasure to share ways to challenge, reach and teach the Millennials at your conference! Carpe Diem! Let us think!
a mission-driven approach to personalizing the customer journeychris wiggins
Keynote talk at PyData NYC 2019 by Anne Bauer, Lead Data Scientist, The New York Times, and Chris Wiggins, Chief Data Scientist, The New York Times.
"Data science at The New York Times: a mission-driven approach to personalizing the customer journey"
How does The New York Times use data science to further its mission?
We'll talk about the use of machine learning throughout the company,
from social media promotion to targeted advertising to content
recommendations, and the cross-team collaborations that make it
possible.
Data Science at The New York Times: what industry can learn from us; what we ...chris wiggins
Keynote talk at RSG with DREAM 2019 | November 4-6, 2019 | New York, USA | HOME - RECOMB/ISCB RSG 2019; 12th annual RECOMB/ISCB Conference on Regulatory & Systems Genomics .
pecial Session on Cancer Systems Biology
Regulatory and Systems Genomics 2019 will include an abstract submissions track for a Special Session of Cancer Systems We welcome submissions on computational and experimental advances in the systems-level modeling of cancer. Topics include but are not limited to: regulatory programs and signaling pathways in cancer cells, tumor-immune interactions and the tumor microenvironment, developmental plasticity in tumors and epigenetic analyses, tumor metabolism, genetic and non-genetic sources of heterogeneity, drug response and precision oncology. The session will include presentations from keynote speakers as well as talks from selected abstracts. This special session is sponsored by the Research Center for Cancer Systems Immunology at Memorial Sloan Kettering Cancer Center, an NCI-funded Cancer Systems Biology Consortium (CSBC) Center.
slides uploaded by request
talk presented at the MIDAS seminar, University of Michigan, 2019-04-15. Video available via https://www.youtube.com/watch?v=c7t4LMkq_SU . For more information: https://midas.umich.edu/event/chris-wiggins/
abstract
The Data Science group at The New York Times develops and deploys
machine learning solutions to newsroom and business problems.
Re-framing real-world questions as machine learning tasks requires not
only adapting and extending models and algorithms to new or special
cases but also sufficient breadth to know the right method for the
right challenge. I'll first outline how unsupervised, supervised, and
reinforcement learning methods are increasingly used in human
applications for description, prediction, and prescription,
respectively. I'll then focus on the 'prescriptive' cases, showing how
methods from the reinforcement learning and causal inference
literatures can be of direct impact in engineering, business, and
decision-making more generally.
Talk delivered 2019-06-25 as part of the Summer Institute in Computational Social Science, held at Princeton University https://compsocialscience.github.io/summer-institute/2019/
Video: https://www.youtube.com/watch?v=0suLWheVji0
title:
what should future statisticians, CEO, and senators know about the
history and ethics of data?
abstract:
What should our future statisticians, senators, and CEOs know about the history and ethics of data?
How might understanding that history provide tools and resources to future citizens navigating a future shaped by data empowered algorithms?
I'll present content from a class co-developed over the past several years with Professor Matt Jones of Columbia's Department of History, based on material absent from both the curriculum for future technologists as well as for future humanists.
The intellectual arc traces from the 18th century to present day, beginning with examples of contemporary technological advances, disquieting ethical debates, and financial success powered by panoptic persuasion architectures.
Data: Past, Present, and Future (Cornell Digital Life Seminar on Data Literac...chris wiggins
Data-empowered algorithms are reshaping our professional, personal, and political realities.
However, existing curricula are predominantly designed either for future technologists, focusing on functional capabilities; or for future humanists, focusing on critical and rhetorical context surrounding data.
"Data: Past, Present, and Future" is a new course at Columbia which seeks to define a curriculum at present taught to neither group, yet of interest and utility to future statisticians, CEOs, and senators alike.
The intellectual arc traces from the 18th century to present day, beginning with examples of contemporary technological advances, disquieting ethical debates, and financial success powered by panoptic persuasion architectures.
The weekly cadence of the course pairs primary and secondary readings with Jupyter notebooks in Python, engaging directly with the data and intellectual advances under study.
Throughout, these intellectual technical advances are paired with critical inquiry into the forces which encouraged and benefited from these new capabilities, i.e., the political dimension of data and technology.
Syllabus, Jupyter notebooks, and additional info can be found via https://data-ppf.github.io/
"Data: Past, Present, and Future" is supported by the Columbia University Collaboratory Fellows Fund. Jointly founded by Columbia University’s Data Science Institute and Columbia Entrepreneurship, The Collaboratory@Columbia is a university-wide program dedicated to supporting collaborative curricula innovations designed to ensure that all Columbia University students receive the education and training that they need to succeed in today’s data rich world.
Data: Past, Present, and Future (Lecture 1, Spring 2018)chris wiggins
Slides from Lecture 1 of "Data: Past, Present, and Future",
Jan 17 2018.
New class on how data is impacting our professional, political, and personal realities. Taught by Profs Matt Jones and Chris Wiggins
lean + design thinking in building data productschris wiggins
talk given to "Columbia startup teams including 2016 CVC winners, Ignition Grant winners, TFP fellows, and ASCENT fellows" as part of a mini-bootcamp for founders at columbia's engineering school 2016-05-24
data science @NYT ; inaugural Data Science Initiative Lecturechris wiggins
inaugural Data Science Initiative Lecture @ Brown University
2015-12-04
https://www.eventbrite.com/e/data-science-at-the-new-york-times-tickets-19490272931
presentation for the NYCRIN / NSF meeting 2013-07-24,
showing 1st demo of leanworkbench.com,
open source tools to quantify and drive early stage startups,
with support from NSF award 1305023
Saudi Arabia stands as a titan in the global energy landscape, renowned for its abundant oil and gas resources. It's the largest exporter of petroleum and holds some of the world's most significant reserves. Let's delve into the top 10 oil and gas projects shaping Saudi Arabia's energy future in 2024.
Immunizing Image Classifiers Against Localized Adversary Attacksgerogepatton
This paper addresses the vulnerability of deep learning models, particularly convolutional neural networks
(CNN)s, to adversarial attacks and presents a proactive training technique designed to counter them. We
introduce a novel volumization algorithm, which transforms 2D images into 3D volumetric representations.
When combined with 3D convolution and deep curriculum learning optimization (CLO), itsignificantly improves
the immunity of models against localized universal attacks by up to 40%. We evaluate our proposed approach
using contemporary CNN architectures and the modified Canadian Institute for Advanced Research (CIFAR-10
and CIFAR-100) and ImageNet Large Scale Visual Recognition Challenge (ILSVRC12) datasets, showcasing
accuracy improvements over previous techniques. The results indicate that the combination of the volumetric
input and curriculum learning holds significant promise for mitigating adversarial attacks without necessitating
adversary training.
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdffxintegritypublishin
Advancements in technology unveil a myriad of electrical and electronic breakthroughs geared towards efficiently harnessing limited resources to meet human energy demands. The optimization of hybrid solar PV panels and pumped hydro energy supply systems plays a pivotal role in utilizing natural resources effectively. This initiative not only benefits humanity but also fosters environmental sustainability. The study investigated the design optimization of these hybrid systems, focusing on understanding solar radiation patterns, identifying geographical influences on solar radiation, formulating a mathematical model for system optimization, and determining the optimal configuration of PV panels and pumped hydro storage. Through a comparative analysis approach and eight weeks of data collection, the study addressed key research questions related to solar radiation patterns and optimal system design. The findings highlighted regions with heightened solar radiation levels, showcasing substantial potential for power generation and emphasizing the system's efficiency. Optimizing system design significantly boosted power generation, promoted renewable energy utilization, and enhanced energy storage capacity. The study underscored the benefits of optimizing hybrid solar PV panels and pumped hydro energy supply systems for sustainable energy usage. Optimizing the design of solar PV panels and pumped hydro energy supply systems as examined across diverse climatic conditions in a developing country, not only enhances power generation but also improves the integration of renewable energy sources and boosts energy storage capacities, particularly beneficial for less economically prosperous regions. Additionally, the study provides valuable insights for advancing energy research in economically viable areas. Recommendations included conducting site-specific assessments, utilizing advanced modeling tools, implementing regular maintenance protocols, and enhancing communication among system components.
About
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Technical Specifications
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
Key Features
Indigenized remote control interface card suitable for MAFI system CCR equipment. Compatible for IDM8000 CCR. Backplane mounted serial and TCP/Ethernet communication module for CCR remote access. IDM 8000 CCR remote control on serial and TCP protocol.
• Remote control: Parallel or serial interface
• Compatible with MAFI CCR system
• Copatiable with IDM8000 CCR
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
Application
• Remote control: Parallel or serial interface.
• Compatible with MAFI CCR system.
• Compatible with IDM8000 CCR.
• Compatible with Backplane mount serial communication.
• Compatible with commercial and Defence aviation CCR system.
• Remote control system for accessing CCR and allied system over serial or TCP.
• Indigenized local Support/presence in India.
• Easy in configuration using DIP switches.
Cosmetic shop management system project report.pdfKamal Acharya
Buying new cosmetic products is difficult. It can even be scary for those who have sensitive skin and are prone to skin trouble. The information needed to alleviate this problem is on the back of each product, but it's thought to interpret those ingredient lists unless you have a background in chemistry.
Instead of buying and hoping for the best, we can use data science to help us predict which products may be good fits for us. It includes various function programs to do the above mentioned tasks.
Data file handling has been effectively used in the program.
The automated cosmetic shop management system should deal with the automation of general workflow and administration process of the shop. The main processes of the system focus on customer's request where the system is able to search the most appropriate products and deliver it to the customers. It should help the employees to quickly identify the list of cosmetic product that have reached the minimum quantity and also keep a track of expired date for each cosmetic product. It should help the employees to find the rack number in which the product is placed.It is also Faster and more efficient way.
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)MdTanvirMahtab2
This presentation is about the working procedure of Shahjalal Fertilizer Company Limited (SFCL). A Govt. owned Company of Bangladesh Chemical Industries Corporation under Ministry of Industries.
1. what is a computational biologist
doing at the New York Times?
!
(and what can academia do for a
163-year old company?)
chris.wiggins@columbia.edu
chris.wiggins@nytimes.com
chris.wiggins@hackNY.org
@chrishwiggins
50. data science: culture
- be communicative
(promote rhetorical literacy)
- related: strive to build models
which are both predictive and
interpretable
62. practices:
1. reframe questions as ML
2. better wrong than "nice"
3. be relevant
4. aim for hypothesis vs data jeapordy
5. befriend experimentalists
63. skills:
1. find quantifiables
2. straw man first
3. small wins before feature engineering
4. data engineering before data science
!
66. what is a computational biologist
doing at the New York Times?
!
(and what can academia do for a
163-year old company?)
chris.wiggins@columbia.edu
chris.wiggins@nytimes.com
chris.wiggins@hackNY.org
@chrishwiggins