SlideShare a Scribd company logo
1 of 7
Download to read offline
A Miner’s Drift
A Journal of Occasional Explorations
Joseph Starwood – Digital Advisor, Geologist, & Geophysicist
Volume 01
Issue 01
2018-MAR-27
Deep Learning &
Artificial Intelligence
in Mining
It starts with the data
Deep Learning & Artificial
Intelligence in Mining
It starts with the data
Mining presents unique challenges to data scientists. They must walk a
thin line between preserving scientific objectivity and extracting the
greatest benefit from data. However, in specific situations, data
scientists must understand the source data including its structure and
the transformations performed upon it. Without that understanding,
artificial intelligence efforts yield poor or even meaningless results.
2018-MAR-27 A Miner’s Drift Volume 01 Issue 01 Copyright 2018 All rights reserved
Mining professionals rely on a
staggering volume and variety of
data. From exploration through
extraction and beneficiation to final
shipment, miners utilize devices,
systems, and applications that
produce a wide range of data. This
data may include geologic
observations, geochemical results,
geophysical measurements,
engineering criteria, assay results,
mine block models, mine planning
details, productions yields, financials,
and more.
This assortment of data presents the
usual problems to data scientists.
These include vendor proprietary
data schemas and exchange formats,
custom developed data schemas and
exchange formats, structural and
semantic differences across schemas
and formats, missing data values,
and so on. Through experience, data
scientists are well-prepared to deal
with these problems.
GEOSCIENTIFIC DATA
Geoscientific data, however, presents
a unique challenge. Geologists,
geochemists, and geophysicists
observe and measure many different
rock types, alteration types,
minerals, and elements.
To accommodate this, they utilize
applications and databases that have
a very high degree of data field reuse.
In other words, a given field may
mean one thing in the first record
and another thing in the second
record.
To the geoscientist, this is very
familiar and convenient. But, for the
data scientist, it is particularly vexing.
THE SOURCE DATA PROBLEM
Let’s look at this more closely. Say
you are creating a new artificial
intelligence (AI) solution to help a
mining company find better
exploratory drilling targets.
Suppose that the core log database
records the lithology, the three most
prevalent alteration types with their
intensities, and the five most
prevalent elements and their values
among many other data items.
The data record fields would look
something like this:
Core ID, . . . Lithology, AltType1,
AltType1-Int, AltType2, AltType2-Int,
AltType3, AltType3-Int, Elem1,
Elem1Value, Elem2, Elem2Value,
Elem3, Elem3Value, Elem4,
Elem4Value, Elem5, Elem5Value, . . .
In the first record, the lithology is
Greywacke. The most prevalent
alteration isTourmaline, which has a
low intensity. Gold is the most
prevalent element at 20 ppm. The
remaining four elements in order are
Silicon, Sulphur,Arsenic, and Iron.
In the next record, we have Schist.
This time, the most prevalent
alteration is Silica, which has a high
intensity.The second most prevalent
PAGE 3
2018-MAR-27 A Miner’s Drift Volume 01 Issue 01 Copyright 2018 All rights reserved
alteration is Chloritization which has
a moderate intensity. Silicon is the
most prevalent at 35%, Chlorine,
Sulphur, Iron, and Copper follow.
That AltType1 is ‘Tourmaline’ and
AltType1Int is ‘low’ in the first record
while AltType1 is ‘Silica’ and
AltType1Int is ‘high’ in the second
record presents a real problem.
Comparing the raw source data in
such records is like comparing apples
to oranges. This data is not ready for
use in an artificial intelligence or
advanced analytics solution.
FIXINGTHE PROBLEM
Ideally, records should have fields for
each alteration type and each
element in which the respective
values are stored. This logic applies
to any reused data field in the record.
To fix this source data problem, each
data value must be read and then
written to a new data record such
that the value goes into the field
intended for that specific data. This
process is referred to as ‘flattening’
the data. The new data record is
referred to as ‘flattened’ data.
The data record fields would look
something like this:
Core ID, . . . Lithology, ChorAlt, . . .
EpidAlt, OxidAlt, PyriAlt, SerpAlt,
SiliAlt, . . .TourAlt, . . .Ac, Ag, Al, Am, Ar,
As, At, Au, . . .Yb, Zn, Zr . . .
There are 94 naturally occurring
elements. A given mining
environment may exhibit a dozen
alteration types. As you can see,
flattening the data results in each
data record having perhaps 200+
fields.
Deciphering the source data records
is not easy. Mining and geosciences
have domain-specific languages and
notations. Partnering with a
geologist or geotechnician helps the
data scientist accelerate the data
flattening and ensure correctness.
BETTER ARTIFICIAL INTELLIGENCE
Results from several mining projects
around the world demonstrate the
differences between raw and
flattened data on artificial
intelligence outcomes.
The typical artificial intelligence
solution utilized MicrosoftAzure
including Machine Learning, Blob
Storage, SQL Database, and Power
BI.
The precision and accuracy for
predicting high-grade ore were in the
mid-70 percent range for the raw
(non-flattened) data. Here the non-
reused fields appear to have played a
role. For the flattened data, the
precision and accuracy for predicting
high-grade ore were in the high-80
percent range.
PAGE 4
CONCLUSION
Artificial intelligence can play a powerful role in mining. The results
depend upon the completeness and correctness of the input data. In
many cases, the raw data must first be transformed to make in amenable
to deep learning and artificial intelligence. Data scientists and
geoscientists must collaborate to correctly transform the data and
achieve meaningful results.
During his early career, Joseph worked as an exploration geologist and
geophysicist. He focused on gold, silver, copper, and molybdenum deposits in
the western United States. His work included Kuroko style gold systems. He
also worked in various capacities for coal mining, oil & gas, and heavy
construction companies.
Joseph has presented at:
▪ InfraGardGreat Lakes Regional Conference – 2014
▪ Society for Mining, Metallurgy, and Exploration Annual Meeting – 2014
▪ Great Lakes Software ExcellenceConference (GLSEC) – 2011 & 2013
▪ Calvin College Colloquium Series – 2010
▪ Integrate: Uniting theWorld of IT Symposium – 2010
▪ SEI ArchitectureTechnology User Network (SATURN) – 2019
▪ Financial Services Solutions Symposium (FSSS) – 2018 & 2019
He also contributed to:
• Software Engineering Institute (SEI) – Hard Problems in SOA Workshop – 2018
Contact theAuthor:
• LinkedIn: www.linkedin.com/in/josephstarwood
• e-Mail: JosephStarwood@JosephStarwood.com
Known within Microsoft Corporation as 'The Mining
Guy', Joseph Starwood is a Digital Advisor who helps
mining companies, equipment suppliers, and service
providers extract value from their information
technology (IT) and operational technology (OT)
investments and assets. He co-authored the Mining
Book of Dreams as well as the Microsoft Upstream
Reference Architecture for Mining (MURA-Mining).
PLEASE NOTE
The author is an employee of Microsoft Corporation. The views, thoughts, and opinions expressed in this text belong wholly to the author, and do not necessarily reflect those of anyone else (the author’semployer, or any other group or individual).
All product names, logos, and brands are property of their respective owners. All company, product and service names used in this text are for identification purposes only. Use of these names, logos, and brands does not imply endorsement.
The information in this text is provided without representations or warranties, express or implied.
ThankYou!

More Related Content

Similar to A Miners Drift - Volume 01 Issue 01 - 2018-MAR-27

Nanowire LED patent investigation Sample
Nanowire LED patent investigation SampleNanowire LED patent investigation Sample
Nanowire LED patent investigation SampleKnowmade
 
Haskel - Spillovers from public intangibles
Haskel - Spillovers from public intangiblesHaskel - Spillovers from public intangibles
Haskel - Spillovers from public intangiblesinnovationoecd
 
OECD Blue Skies Conference. Sept 2016
OECD Blue Skies Conference. Sept 2016OECD Blue Skies Conference. Sept 2016
OECD Blue Skies Conference. Sept 2016SPINTAN
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Data Science London
 
Equipment & Materials for 3DIC & Wafer-Level Packaging Applications 2014 Repo...
Equipment & Materials for 3DIC & Wafer-Level Packaging Applications 2014 Repo...Equipment & Materials for 3DIC & Wafer-Level Packaging Applications 2014 Repo...
Equipment & Materials for 3DIC & Wafer-Level Packaging Applications 2014 Repo...Yole Developpement
 
Big Data Mining Keynote presentation Sept 2013 09012013
Big Data Mining Keynote presentation Sept 2013 09012013Big Data Mining Keynote presentation Sept 2013 09012013
Big Data Mining Keynote presentation Sept 2013 09012013Julio Da Silva
 
Geothermal Landscape study final
Geothermal Landscape study finalGeothermal Landscape study final
Geothermal Landscape study finalIceland Geothermal
 
Geospatial Big Data: Business Cases from proDataMarket
Geospatial Big Data: Business Cases from proDataMarketGeospatial Big Data: Business Cases from proDataMarket
Geospatial Big Data: Business Cases from proDataMarketdapaasproject
 
Machine Learning in Oil and Gas - April 18-19, 2018
Machine Learning in Oil and Gas - April 18-19, 2018Machine Learning in Oil and Gas - April 18-19, 2018
Machine Learning in Oil and Gas - April 18-19, 2018Mark Reynolds
 
Technology Observatory Examples, Tools and Techniques
Technology Observatory Examples, Tools and TechniquesTechnology Observatory Examples, Tools and Techniques
Technology Observatory Examples, Tools and TechniquesVahid Shamekhi
 
A Statistician's View on Big Data and Data Science (Version 2)
A Statistician's View on Big Data and Data Science (Version 2)A Statistician's View on Big Data and Data Science (Version 2)
A Statistician's View on Big Data and Data Science (Version 2)Prof. Dr. Diego Kuonen
 
Rare Earth Metals Recycling Market Set to Cross $422 Million by 2026
Rare Earth Metals Recycling Market Set to Cross $422 Million by 2026Rare Earth Metals Recycling Market Set to Cross $422 Million by 2026
Rare Earth Metals Recycling Market Set to Cross $422 Million by 2026Future Market Insights
 

Similar to A Miners Drift - Volume 01 Issue 01 - 2018-MAR-27 (20)

Nanowire LED patent investigation Sample
Nanowire LED patent investigation SampleNanowire LED patent investigation Sample
Nanowire LED patent investigation Sample
 
Haskel - Spillovers from public intangibles
Haskel - Spillovers from public intangiblesHaskel - Spillovers from public intangibles
Haskel - Spillovers from public intangibles
 
OECD Blue Skies Conference. Sept 2016
OECD Blue Skies Conference. Sept 2016OECD Blue Skies Conference. Sept 2016
OECD Blue Skies Conference. Sept 2016
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 
Lawie Ti vs Zr.pdf
Lawie Ti vs Zr.pdfLawie Ti vs Zr.pdf
Lawie Ti vs Zr.pdf
 
Equipment & Materials for 3DIC & Wafer-Level Packaging Applications 2014 Repo...
Equipment & Materials for 3DIC & Wafer-Level Packaging Applications 2014 Repo...Equipment & Materials for 3DIC & Wafer-Level Packaging Applications 2014 Repo...
Equipment & Materials for 3DIC & Wafer-Level Packaging Applications 2014 Repo...
 
The public sector and integrated operations
The public sector and integrated operationsThe public sector and integrated operations
The public sector and integrated operations
 
Big Data Mining Keynote presentation Sept 2013 09012013
Big Data Mining Keynote presentation Sept 2013 09012013Big Data Mining Keynote presentation Sept 2013 09012013
Big Data Mining Keynote presentation Sept 2013 09012013
 
Geothermal Landscape study final
Geothermal Landscape study finalGeothermal Landscape study final
Geothermal Landscape study final
 
R - datascience
R - datascienceR - datascience
R - datascience
 
Geospatial Big Data: Business Cases from proDataMarket
Geospatial Big Data: Business Cases from proDataMarketGeospatial Big Data: Business Cases from proDataMarket
Geospatial Big Data: Business Cases from proDataMarket
 
Data Science Course.pdf
Data Science Course.pdfData Science Course.pdf
Data Science Course.pdf
 
Machine Learning in Oil and Gas - April 18-19, 2018
Machine Learning in Oil and Gas - April 18-19, 2018Machine Learning in Oil and Gas - April 18-19, 2018
Machine Learning in Oil and Gas - April 18-19, 2018
 
01datamining.pdf
01datamining.pdf01datamining.pdf
01datamining.pdf
 
Technology Observatory Examples, Tools and Techniques
Technology Observatory Examples, Tools and TechniquesTechnology Observatory Examples, Tools and Techniques
Technology Observatory Examples, Tools and Techniques
 
Strategic Metals Bulletin Bulletin #38
Strategic Metals Bulletin Bulletin #38Strategic Metals Bulletin Bulletin #38
Strategic Metals Bulletin Bulletin #38
 
sulfur
 sulfur sulfur
sulfur
 
A Statistician's View on Big Data and Data Science (Version 2)
A Statistician's View on Big Data and Data Science (Version 2)A Statistician's View on Big Data and Data Science (Version 2)
A Statistician's View on Big Data and Data Science (Version 2)
 
Big data story of success
Big data story of successBig data story of success
Big data story of success
 
Rare Earth Metals Recycling Market Set to Cross $422 Million by 2026
Rare Earth Metals Recycling Market Set to Cross $422 Million by 2026Rare Earth Metals Recycling Market Set to Cross $422 Million by 2026
Rare Earth Metals Recycling Market Set to Cross $422 Million by 2026
 

Recently uploaded

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 

Recently uploaded (20)

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 

A Miners Drift - Volume 01 Issue 01 - 2018-MAR-27

  • 1. A Miner’s Drift A Journal of Occasional Explorations Joseph Starwood – Digital Advisor, Geologist, & Geophysicist Volume 01 Issue 01 2018-MAR-27 Deep Learning & Artificial Intelligence in Mining It starts with the data
  • 2. Deep Learning & Artificial Intelligence in Mining It starts with the data Mining presents unique challenges to data scientists. They must walk a thin line between preserving scientific objectivity and extracting the greatest benefit from data. However, in specific situations, data scientists must understand the source data including its structure and the transformations performed upon it. Without that understanding, artificial intelligence efforts yield poor or even meaningless results.
  • 3. 2018-MAR-27 A Miner’s Drift Volume 01 Issue 01 Copyright 2018 All rights reserved Mining professionals rely on a staggering volume and variety of data. From exploration through extraction and beneficiation to final shipment, miners utilize devices, systems, and applications that produce a wide range of data. This data may include geologic observations, geochemical results, geophysical measurements, engineering criteria, assay results, mine block models, mine planning details, productions yields, financials, and more. This assortment of data presents the usual problems to data scientists. These include vendor proprietary data schemas and exchange formats, custom developed data schemas and exchange formats, structural and semantic differences across schemas and formats, missing data values, and so on. Through experience, data scientists are well-prepared to deal with these problems. GEOSCIENTIFIC DATA Geoscientific data, however, presents a unique challenge. Geologists, geochemists, and geophysicists observe and measure many different rock types, alteration types, minerals, and elements. To accommodate this, they utilize applications and databases that have a very high degree of data field reuse. In other words, a given field may mean one thing in the first record and another thing in the second record. To the geoscientist, this is very familiar and convenient. But, for the data scientist, it is particularly vexing. THE SOURCE DATA PROBLEM Let’s look at this more closely. Say you are creating a new artificial intelligence (AI) solution to help a mining company find better exploratory drilling targets. Suppose that the core log database records the lithology, the three most prevalent alteration types with their intensities, and the five most prevalent elements and their values among many other data items. The data record fields would look something like this: Core ID, . . . Lithology, AltType1, AltType1-Int, AltType2, AltType2-Int, AltType3, AltType3-Int, Elem1, Elem1Value, Elem2, Elem2Value, Elem3, Elem3Value, Elem4, Elem4Value, Elem5, Elem5Value, . . . In the first record, the lithology is Greywacke. The most prevalent alteration isTourmaline, which has a low intensity. Gold is the most prevalent element at 20 ppm. The remaining four elements in order are Silicon, Sulphur,Arsenic, and Iron. In the next record, we have Schist. This time, the most prevalent alteration is Silica, which has a high intensity.The second most prevalent PAGE 3
  • 4. 2018-MAR-27 A Miner’s Drift Volume 01 Issue 01 Copyright 2018 All rights reserved alteration is Chloritization which has a moderate intensity. Silicon is the most prevalent at 35%, Chlorine, Sulphur, Iron, and Copper follow. That AltType1 is ‘Tourmaline’ and AltType1Int is ‘low’ in the first record while AltType1 is ‘Silica’ and AltType1Int is ‘high’ in the second record presents a real problem. Comparing the raw source data in such records is like comparing apples to oranges. This data is not ready for use in an artificial intelligence or advanced analytics solution. FIXINGTHE PROBLEM Ideally, records should have fields for each alteration type and each element in which the respective values are stored. This logic applies to any reused data field in the record. To fix this source data problem, each data value must be read and then written to a new data record such that the value goes into the field intended for that specific data. This process is referred to as ‘flattening’ the data. The new data record is referred to as ‘flattened’ data. The data record fields would look something like this: Core ID, . . . Lithology, ChorAlt, . . . EpidAlt, OxidAlt, PyriAlt, SerpAlt, SiliAlt, . . .TourAlt, . . .Ac, Ag, Al, Am, Ar, As, At, Au, . . .Yb, Zn, Zr . . . There are 94 naturally occurring elements. A given mining environment may exhibit a dozen alteration types. As you can see, flattening the data results in each data record having perhaps 200+ fields. Deciphering the source data records is not easy. Mining and geosciences have domain-specific languages and notations. Partnering with a geologist or geotechnician helps the data scientist accelerate the data flattening and ensure correctness. BETTER ARTIFICIAL INTELLIGENCE Results from several mining projects around the world demonstrate the differences between raw and flattened data on artificial intelligence outcomes. The typical artificial intelligence solution utilized MicrosoftAzure including Machine Learning, Blob Storage, SQL Database, and Power BI. The precision and accuracy for predicting high-grade ore were in the mid-70 percent range for the raw (non-flattened) data. Here the non- reused fields appear to have played a role. For the flattened data, the precision and accuracy for predicting high-grade ore were in the high-80 percent range. PAGE 4
  • 5. CONCLUSION Artificial intelligence can play a powerful role in mining. The results depend upon the completeness and correctness of the input data. In many cases, the raw data must first be transformed to make in amenable to deep learning and artificial intelligence. Data scientists and geoscientists must collaborate to correctly transform the data and achieve meaningful results.
  • 6. During his early career, Joseph worked as an exploration geologist and geophysicist. He focused on gold, silver, copper, and molybdenum deposits in the western United States. His work included Kuroko style gold systems. He also worked in various capacities for coal mining, oil & gas, and heavy construction companies. Joseph has presented at: ▪ InfraGardGreat Lakes Regional Conference – 2014 ▪ Society for Mining, Metallurgy, and Exploration Annual Meeting – 2014 ▪ Great Lakes Software ExcellenceConference (GLSEC) – 2011 & 2013 ▪ Calvin College Colloquium Series – 2010 ▪ Integrate: Uniting theWorld of IT Symposium – 2010 ▪ SEI ArchitectureTechnology User Network (SATURN) – 2019 ▪ Financial Services Solutions Symposium (FSSS) – 2018 & 2019 He also contributed to: • Software Engineering Institute (SEI) – Hard Problems in SOA Workshop – 2018 Contact theAuthor: • LinkedIn: www.linkedin.com/in/josephstarwood • e-Mail: JosephStarwood@JosephStarwood.com Known within Microsoft Corporation as 'The Mining Guy', Joseph Starwood is a Digital Advisor who helps mining companies, equipment suppliers, and service providers extract value from their information technology (IT) and operational technology (OT) investments and assets. He co-authored the Mining Book of Dreams as well as the Microsoft Upstream Reference Architecture for Mining (MURA-Mining). PLEASE NOTE The author is an employee of Microsoft Corporation. The views, thoughts, and opinions expressed in this text belong wholly to the author, and do not necessarily reflect those of anyone else (the author’semployer, or any other group or individual). All product names, logos, and brands are property of their respective owners. All company, product and service names used in this text are for identification purposes only. Use of these names, logos, and brands does not imply endorsement. The information in this text is provided without representations or warranties, express or implied.