Slides reviewing the paper:
Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. "Attention is all you need." In Advances in Neural Information Processing Systems, pp. 6000-6010. 2017.
The dominant sequence transduction models are based on complex recurrent orconvolutional neural networks in an encoder and decoder configuration. The best performing such models also connect the encoder and decoder through an attentionm echanisms. We propose a novel, simple network architecture based solely onan attention mechanism, dispensing with recurrence and convolutions entirely.Experiments on two machine translation tasks show these models to be superiorin quality while being more parallelizable and requiring significantly less timeto train. Our single model with 165 million parameters, achieves 27.5 BLEU onEnglish-to-German translation, improving over the existing best ensemble result by over 1 BLEU. On English-to-French translation, we outperform the previoussingle state-of-the-art with model by 0.7 BLEU, achieving a BLEU score of 41.1.
This is the slide from my talk at FULokoja Ingressive meetup.
XGBoost is a decision-tree-based ensemble Machine Learning algorithm that uses a gradient boosting framework. In prediction problems involving unstructured and structured data (images, text, etc.) artificial neural networks tend to outperform all other algorithms or frameworks. However, when it comes to small-to-medium structured/tabular data, decision tree-based algorithms are considered best-in-class right now. XGBoost model has the best combination of prediction performance and processing time compared to other algorithms.
is anyone_interest_in_auto-encoding_variational-bayesNAVER Engineering
Deep generative model 중 하나인 VAE의 Framework은 컴퓨터 비전, 자연어 처리 등 머신러닝의 전반에서 generative model의 변화를 가져왔다.
VAE를 처음 접하는 연구자들을 위해 대부분의 VAE tutorial은 구현을 목적으로 Neural Network구조와 Loss function에 초점을 맞추고 있다. 본 세미나는 Variational Inference 관점에서 Auto-encoding variational bayes에 나오는 수식들을 살펴보고자 한다. 본 수식들이 구현에서는 어떻게 적용되는지도 살펴보고자 한다.
Meta-learning, or learning how to learn, is our innate ability to learn new, ever more complex tasks very efficiently by building on prior experience. It is a very exciting direction for machine learning (and AI in general). In this tutorial, I introduce the main concepts and state of the art.
cvpaper.challenge の Meta Study Group 発表スライド
cvpaper.challenge はコンピュータビジョン分野の今を映し、トレンドを創り出す挑戦です。論文サマリ・アイディア考案・議論・実装・論文投稿に取り組み、凡ゆる知識を共有します。2019の目標「トップ会議30+本投稿」「2回以上のトップ会議網羅的サーベイ」
http://xpaperchallenge.org/cv/
Deep Learning: Recurrent Neural Network (Chapter 10) Larry Guo
This Material is an in_depth study report of Recurrent Neural Network (RNN)
Material mainly from Deep Learning Book Bible, http://www.deeplearningbook.org/
Topics: Briefing, Theory Proof, Variation, Gated RNNN Intuition. Real World Application
Application (CNN+RNN on SVHN)
Also a video (In Chinese)
https://www.youtube.com/watch?v=p6xzPqRd46w
Using Java & Genetic Algorithms to Beat the MarketMatthew Ring
I presented this at JavaOne 2011 along with a demo of software that I've written. Since you won't see the demo, I added a few more slides to explain it.
Slides reviewing the paper:
Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. "Attention is all you need." In Advances in Neural Information Processing Systems, pp. 6000-6010. 2017.
The dominant sequence transduction models are based on complex recurrent orconvolutional neural networks in an encoder and decoder configuration. The best performing such models also connect the encoder and decoder through an attentionm echanisms. We propose a novel, simple network architecture based solely onan attention mechanism, dispensing with recurrence and convolutions entirely.Experiments on two machine translation tasks show these models to be superiorin quality while being more parallelizable and requiring significantly less timeto train. Our single model with 165 million parameters, achieves 27.5 BLEU onEnglish-to-German translation, improving over the existing best ensemble result by over 1 BLEU. On English-to-French translation, we outperform the previoussingle state-of-the-art with model by 0.7 BLEU, achieving a BLEU score of 41.1.
This is the slide from my talk at FULokoja Ingressive meetup.
XGBoost is a decision-tree-based ensemble Machine Learning algorithm that uses a gradient boosting framework. In prediction problems involving unstructured and structured data (images, text, etc.) artificial neural networks tend to outperform all other algorithms or frameworks. However, when it comes to small-to-medium structured/tabular data, decision tree-based algorithms are considered best-in-class right now. XGBoost model has the best combination of prediction performance and processing time compared to other algorithms.
is anyone_interest_in_auto-encoding_variational-bayesNAVER Engineering
Deep generative model 중 하나인 VAE의 Framework은 컴퓨터 비전, 자연어 처리 등 머신러닝의 전반에서 generative model의 변화를 가져왔다.
VAE를 처음 접하는 연구자들을 위해 대부분의 VAE tutorial은 구현을 목적으로 Neural Network구조와 Loss function에 초점을 맞추고 있다. 본 세미나는 Variational Inference 관점에서 Auto-encoding variational bayes에 나오는 수식들을 살펴보고자 한다. 본 수식들이 구현에서는 어떻게 적용되는지도 살펴보고자 한다.
Meta-learning, or learning how to learn, is our innate ability to learn new, ever more complex tasks very efficiently by building on prior experience. It is a very exciting direction for machine learning (and AI in general). In this tutorial, I introduce the main concepts and state of the art.
cvpaper.challenge の Meta Study Group 発表スライド
cvpaper.challenge はコンピュータビジョン分野の今を映し、トレンドを創り出す挑戦です。論文サマリ・アイディア考案・議論・実装・論文投稿に取り組み、凡ゆる知識を共有します。2019の目標「トップ会議30+本投稿」「2回以上のトップ会議網羅的サーベイ」
http://xpaperchallenge.org/cv/
Deep Learning: Recurrent Neural Network (Chapter 10) Larry Guo
This Material is an in_depth study report of Recurrent Neural Network (RNN)
Material mainly from Deep Learning Book Bible, http://www.deeplearningbook.org/
Topics: Briefing, Theory Proof, Variation, Gated RNNN Intuition. Real World Application
Application (CNN+RNN on SVHN)
Also a video (In Chinese)
https://www.youtube.com/watch?v=p6xzPqRd46w
Using Java & Genetic Algorithms to Beat the MarketMatthew Ring
I presented this at JavaOne 2011 along with a demo of software that I've written. Since you won't see the demo, I added a few more slides to explain it.
Option Pricing Models Lecture NotesThis week’s assignment is .docxhopeaustin33688
Option Pricing Models Lecture Notes:
This week’s assignment is quite complex. Keep in mind that the theory behind these pricing models is the important thing to remember for this week’s assignment.
If you feel the need to understand the Black Scholes (BSOPM) model in greater detail, I direct you to and http://en.wikipedia.org/wiki/Black_Scholes.
The models we discuss this week can be used via MS Excel templates, which you will find uploaded to the course content section of our classroom under this week’s folder. There is also an alternative calculator, courtesy of 888options.com located at the Binomial & Black Scholes Calculator link. I strongly encourage you to try these out to get a feel for how the different variables play into the final determination of pricing.
1. Binomial options pricing model
In finance, the binomial options pricing model provides a generalisable numerical method for the valuation of options. The binomial model was first proposed by Cox, Ross and Rubinstein (1979). Essentially, the model uses a "discrete-time" model of the varying price over time of the underlying financial instrument. Option valuation is then via application of therisk neutrality assumption over the life of the option, as the price of the underlying instrument evolves.
Use of the model
The Binomial options pricing model approach is widely used as it is able to handle a variety of conditions for which other models cannot easily be applied. This is largely because the BOPM models the underlying instrument over time - as opposed to at a particular point. For example, the model is used to value American options which can be exercised at any point and Bermudan options which can be exercised at various points.
The model is also relatively simple, mathematically, and can therefore be readily implemented in a software (or even spreadsheet) environment. Although slower than the Black-Scholes model, it is considered more accurate, particularly for longer-dated options, and options on securities with dividend payments. For these reasons, various versions of the binomial model are widely used by practitioners in the options markets.
For options with several sources of uncertainty (e.g. real options), or for options with complicated features (e.g. Asian options), lattice methods face several difficulties and are not practical. Monte Carlo option models are generally used in these cases. Monte Carlo simulation is, however, time-consuming in terms of computation, and is not used when the Lattice approach (or a formula) will suffice. See Monte Carlo methods in finance.
Methodology
The binomial pricing model uses a "discrete-time framework" to trace the evolution of the option's key underlying variable via a binomial lattice (tree), for a given number of time steps between valuation date and option expiration.
Each node in the lattice represents a possible price of the underlying, at a particular point in time. This price evolution forms the basis for t.
Applying Deep Learning to Enhance Momentum Trading Strategies in StocksLawrence Takeuchi
Contact author: larrytakeuchi@gmail.com
Abstract
We use an autoencoder composed of stacked restricted Boltzmann machines to extract
features from the history of individual stock prices. Our model is able to discover an enhanced version of the momentum effect in stocks without extensive hand-engineering of input features and deliver an annualized return of 45.93% over the 1990-2009 test period
versus 10.53% for basic momentum.
Using Large Language Models in 10 Lines of CodeGautier Marti
Modern NLP models can be daunting: No more bag-of-words but complex neural network architectures, with billions of parameters. Engineers, financial analysts, entrepreneurs, and mere tinkerers, fear not! You can get started with as little as 10 lines of code.
Presentation prepared for the Abu Dhabi Machine Learning Meetup Season 3 Episode 3 hosted at ADGM in Abu Dhabi.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
2. Engineer Summary
● The normal assumption in Black-Scholes
option pricing formula is outdated, duh.
● Generative Adversarial Networks (GANs) are
the new golden standard for simulation.
● Here is the story of how 2 data scientists
(inc. a former trader) deployed a GAN for
option pricing in real-time, in 10 days.
3. 1. A Short History of Option Pricing
2. GANs for Time Series Generation
3. Option Pricing in Real-Time (demo!)
Agenda
5. Options: contract that gives the bearer the right to buy or sell a stock at a given date for a given price
- Call: how much is the right to buy XYZ at 100 USD in 30 days worth?
- Put: how much is the right to sell XYZ at 100 USD in 30 days worth?
Subtleties in pricing such as
- Type of options (European/American)
- Dividends
Why is it so important to get the value right?
- To be competitive when a bid is coming from the market
- To avoid being the target of arbitrageurs...
Options 101
6. Black and Scholes to determine the fair price of the theoretical value for an option based on 6 parameters:
- Type of options (american/european)
- risk-free rate
- strike price
- underlying stock price
- time
- Volatility
Hypothesis: a stock follows a random walk where the volatility of the stock plays a big part.
The Black–Scholes–Merton model
7. If assumption is made that prices are distributed log normally - which may or not be true - then log
returns are conveniently normally distributed.
Volatility is a measure of the standard deviation of the log returns.
Modeling Stock Prices & Returns
8. In the Black-Scholes model, asset price St is assumed to follow a Geometric Brownian motion (GBM) as
defined by the stochastic differential equation
The Formula of Robert Brown et al.
9. Option Payoffs 101
Call: In exchange for a premium, the bearer has the right to buy the stock at the expiration date for a given price K.
Put: In exchange for a premium, the bearer has the right to sell the stock at the expiration date for a given price K.
10. 1. Simulate a high number of trajectories for a stock
2. Compute the pay-off for each of the trajectory
3. Fair price as the average of the pay-offs
Just Monte Carlo it.
11. Does the distribution of returns really follow a normal distribution…? Not really.
Lots of improvement and more complex assumption to get better (ARCH/GARCH etc). But still.
How to Evaluate the Fair Price of an Option?
13. GAN is a Deep Learning model that aims to generate examples of real-world data that
are as realistic as possible.
Very famous recently for image generation.
Generative Adversarial Network 101
Fake or real ?
14. Getting Rid of the Normal Assumption
● Let’s try to use Deep learning to get something that is independent of human
interpretation:
● The main focus for GAN is to generate data from scratch
● It is composed of two networks - a generator and a discriminator
● The generator has to produce some times series of log-returns realistic enough
to fool the discriminator trained to recognize real log-returns times series.
15. Daily historic prices of SP500 stocks between 2002 and 2017:
1. Create chunk of time series of size 50. Those are our
real examples
2. Generate some fake times series (of size 50) starting
from noise - or can we be smarter?
3. Train a discriminator to flag the real from the fakes
4. Stop when the fake examples seem real enough
GAN GAN style plan
16. Tip: do not start from pure random noise but … from a GBM because it is what you want to improve.
A Simple Architecture to Demo the Approach
1. The Generator
17. A generator with 3 dense layers
Importance of the LeakyRelu activation function
Do not forget the dropout
A Simple Architecture to Demo the Approach
2. The Discriminator
21. GAN tricks for smoother training => read the repo ganhacks by Soumith Chintala
Try some more complex architecture to have an even better times series generator:
- RGAN or RCGAN very appropriate for a stochastic use case (LSTM instead of Dense)
- Have the generator learn more directly from the discriminator: Wasserstein GAN
- Add some Gradient Penalty to the generator to prevent it from being too smart…
Also we did not take the time to implement some sort of error metric to see how much better we improve the
generation from one model to another: RMSE should be a good starting point.
Going Further
23. Conclusions
From English data
and Japanese affix –iku (育)
“To raise or bring up; to grow up”
Literally,”Data Education”
or ”Let’s Grow the Data skills”
データ
育