Pub/Sub model, msm, and asio

Pub/Subモデルとmsmとasioと
Takatoshi Kondo
2016/7/23 1

発表内容
2016/7/23 2
• Pub/Subモデルとは？
• コネクションとスレッド
• 2つのスケーラビリティ
• brokerの状態管理とイベントの遅延処理
• msmの要求する排他制御
• io_serviceのpostと実行順序
• async_writeとstrand

自己紹介
2016/7/23 3
• 近藤貴俊
• ハンドルネーム redboltz
• msgpack-cコミッタ
– https://github.com/msgpack/msgpack-c
• MQTTのC++クライアント mqtt_client_cpp
開発
– https://github.com/redboltz/mqtt_client_cpp
• MQTTを拡張したスケーラブルな
brokerを仕事で開発中
• CppCon 2016 参加予定

Pub/Subモデルとは
2016/7/23 4
topic A
publisher 1
subscriber 1
topic B
publisher 2
subscriber 2
hello
world
論理的な概念
subscribe
publish
world

client
Pub/Subモデルとは
2016/7/23 5
broker
clientpublisher subscriber
topictopic
connection
物理的？な配置
node

コネクションとスレッド
2016/7/23 6
broker
client
connection
worker
thread
worker
thread
worker
thread
client client
context switch のコスト増大

コネクションとスレッド
2016/7/23 7
broker
client
connection
boost::asio::io_service on 1 thread
client client

io_service
2016/7/23 8
#include <iostream>
#include <boost/asio.hpp>
int main() {
boost::asio::io_service ios;
ios.post([]{ std::cout << __LINE__ << std::endl; });
ios.run();
}
http://melpon.org/wandbox/permlink/MzfsrLNdJjfAeV15
6
7
8
9
10
1
2
3
4
5
6
7
8
9
10
11
12
様々な処理（ネットワーク、タイマ、シリアルポート、
シグナルハンドル、etc）をio_serviceにpost。
イベントが無くなるまで処理を実行
http://www.boost.org/doc/html/boost_asio/reference.html

io_service
2016/7/23 9
#include <iostream>
int main() {
ios.post([&ios]{
std::cout << __LINE__ << std::endl;
ios.post([&ios]{
ios.post([&ios]{
});
});
});
ios.run();
}
http://melpon.org/wandbox/permlink/lXbFTVurVNUXM8BZ
7
9
11
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
処理の中で次のリクエストをpost

2つのスケーラビリティ
2016/7/23 10
• マルチスレッド
• マルチノード（マルチサーバ）

マルチスレッドにスケールアウト
2016/7/23 11
broker
client
connection
boost::asio::io_service on 1 thread
client client
コアを有効活用したい

2016/7/23 12
#include <iostream>
#include <thread>
int main() {
std::vector<std::thread> ths;
ths.emplace_back([&ios]{ ios.run(); });
for (auto& t : ths) t.join();
std::cout << "finished" << std::endl;
}
http://melpon.org/wandbox/permlink/z5bQJYgO23tvM9XF
8
9
10
11
7
finished
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
実行順序はpostの順序とは異なる

2016/7/23 13
broker
client
connection
client client
ios
client
thread threadthread

subscriber
Pub/Subモデルとロック
2016/7/23 14
subscriber
topic
publisher
subscribers_
subscribe
subscribe
subscribe
unsubscribe
排他ロック
publish
対象のsubscriberに配送
共有ロック

webserver
マルチノードにスケールアウト
2016/7/23 15
client client client
load balancer
webserver webserver
毎回コネクションを切断する、Webサーバなどは
スケールアウトがシンプル

broker
brokerbroker
2016/7/23 16
client client client client
Pub/Subモデルはコネクション型通信のため、
Webサーバのようなリクエスト毎の切断を
前提とするロードバランス戦略をとれない
情報の転送が必要
publisher subscriber
load balancer or dispatcher
hello

broker
brokerbroker
2016/7/23 17
client client client client
ルーティングなどの
情報の同期が必要
publisher subscriber
同期中
publish/Defer
同期済み
publish/配信処理
同期完了
イベント処理の遅延
ステートマシンが常に必須とは限らないが、
今回は必要であると仮定する。

msmとasioの組み合わせ
2016/7/23 18
boost::asio::async_read(
socket_,
boost::asio::buffer(payload_),
[this](
boost::system::error_code const& ec,
std::size_t bytes_transferred){
// error checking ...
// 受信時の処理
}
);
boost::shared_lock<mutex> guard(mtx_subscribers_);
auto& idx = subscribers_.get<tag_topic>();
auto r = idx.equal_range(topic);
for (; r.first != r.second; ++r.first) {
auto& socket = r.first->socket;
boost::asio::write(socket, boost::asio::buffer(payload_));
}
全てのsubscriberに対して
publish内容を配信
msm導入前

2016/7/23 19
struct transition_table:mpl::vector<
// Start Event Next Action Guard
msmf::Row < s_normal, e_pub, msmf::none, a_pub, msmf::none >,
msmf::Row < s_sync, e_pub, msmf::none, msmf::Defer, msmf::none >
> {};
struct a_pub {
template <typename Event, typename Fsm, typename Source, typename Target>
void operator()(Event const& e, Fsm& f, Source&, Target&) const {
boost::shared_lock<mutex> guard(f.mtx_subscribers_);
auto& idx = f.subscribers_.get<tag_topic>();
auto r = idx.equal_range(e.topic);
boost::asio::write(socket, boost::asio::buffer(e.payload));
}
}
};
// boost::asio::async_read ハンドラ内にて
process_event(e_pub(topic, payload));
msm導入後
受信時の処理は
アクションに移動
イベントの遅延が可能
イベントを処理すると
現在状態に応じた
アクションが実行される

msmとスレッド
2016/7/23 20
process_event()の呼び出しはserializeされなければならない

msmとスレッド
2016/7/23 21
同期中
publish/Defer
同期済み
publish/配信処理
同期完了
process_event()の呼び出しはserializeされなければならない
複数のスレッドで同時に状態遷移が起こると、
msmの内部状態がおかしくなるのであろう
// boost::asio::async_read ハンドラ内にて
process_event(e_pub(topic, payload)); ここに排他ロックが必要となる
subscribersubscriberpublish受信 subscriber
配信
配信
別々の受信でも順番に処理せねばならない

msmとスレッド
2016/7/23 22
排他ロック
共有ロック
排他ロック
共有ロック

2016/7/23 23
struct a_pub {
ios.post([&f, e]{
}
});
}
};
排他ロックの必要な範囲では、ios.post()のみ行い、
ios.post()に渡した処理が呼び出されるところで、
共有ロックを行う
post
post
postのみserialize 並行処理が可能

2016/7/23 24
struct a_pub {
ios.post([&f, e]{
}
});
}
};
排他ロックの必要な範囲では、ios.post()のみ行い、
ios.post()に渡した処理が呼び出されるところで、
共有ロックを行う
注意点
・処理の遅延に問題は無いか？
・ios.post()に渡した処理が参照するオブジェクトは生存しているか？

forループの処理もpostすれば。。。
2016/7/23 25
struct a_pub {
ios.post([&f, e]{
ios.post([&socket, e]{
});
}
});
}
};
ループの中で行われるwrite()が並列化され、パフォーマンスの向上が見込める

struct a_pub {
ios.post([&f, e]{
});
}
});
}
};
2016/7/23 26
publish受信 subscriber
post
postのみserialize
並行処理が可能
post
subscriber
subscriber
post
post
subscriber
subscriber
排他ロック共有ロック

broker
2016/7/23 27
client
client
publisher
subscriber
1. subscribe
2. ack
3. publish(data)
4. data
1と3がほぼ同時に発生した場合、subscriberから見て許容される振る舞いは、
2, 4の順で受信（1が3よりも先にbrokerで処理された場合）
または
2のみ受信（1が3よりも後にbrokerで処理された場合）
4, 2の順で受信が発生してはならない。（ackの前にdata到着）

broker
2016/7/23 28
client
client
publisher
subscriber
1. unsubscribe
2. data
3. publish(data)
4. ack
1と3がほぼ同時に発生した場合、subscriberから見て許容される振る舞いは、
2, 4の順で受信（1が3よりも先にbrokerで処理された場合）
または
4のみ受信（1が3よりも後にbrokerで処理された場合）
4, 2の順で受信が発生してはならない。（ackの後にdata到着）

2016/7/23 29
#include <iostream>
#include <thread>
int main() {
std::vector<std::thread> ths;
for (auto& t : ths) t.join();
std::cout << "finished" << std::endl;
}
http://melpon.org/wandbox/permlink/z5bQJYgO23tvM9XF
8
9
10
11
7
finished
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
実行順序はpostの順序とは異なる

2016/7/23 30
struct a_pub {
ios.post([&f, e]{
});
}
});
}
};
unsubscribe処理を行い、ackを返送した後に、この処理が実行されることがある

問題はどこにあるのか？
2016/7/23 31
• 同一コネクションに対する送信の順序を
保証したいが、
• io_service::post()を使うことで、順序の保証が
できなくなっている
• しかし、ループ処理の並列化は行いたい
• コネクションとの対応付けを考慮した、
処理のpostが行えれば良い

boost::asio::async_write
2016/7/23 32
現実的には、このハンドラ内で次のasync_writeを呼ぶことになる

2016/7/23 33
template <typename F>
void my_async_write(
std::shared_ptr<std::string> const& buf,
F const& func) {
strand_.post(
[this, buf, func]
() {
queue_.emplace_back(buf, func);
if (queue_.size() > 1) return;
my_async_write_imp();
}
);
}
まずenque
データは、バッファと完了ハンドラ
未完了のasync_writeがあるなら
何もせず終了
async_writeの呼び出し処理
制約無く、いつでも呼べる、async_writeを作るには、
自前でキューイングなどの処理を実装する必要がある。

2016/7/23 34
void my_async_write_imp() {
auto& elem = queue_.front();
auto const& func = elem.handler();
as::async_write(
socket_,
as::buffer(elem.ptr(), elem.size()),
strand_.wrap(
[this, func]
(boost::system::error_code const& ec,
std::size_t bytes_transferred) {
func(ec);
queue_.pop_front();
if (!queue_.empty()) {
my_async_write_imp();
}
}
)
);
}
queueからデータを取り出して、
async_write
まだqueueにデータがあれば、
再びasync_write
queueからデータを消去し
strand_.post() および strand_.wrap() を用いて、
排他制御を行っている
queue_ だけ mutex でロックするのと何が違うのか？

async_readもstrand wrapする
2016/7/23 35
boost::asio::async_read(
socket_,
boost::asio::buffer(payload_),
strand_.wrap(
[this](
boost::system::error_code const& ec,
std::size_t bytes_transferred){
// error checking ...
// 受信時の処理
}
)
);
async_readもstrand経由で処理する

strandは本当に必要か？
2016/7/23 36
strandしなくても、暗黙的にstrandになるケース

publish処理
2016/7/23 37
struct a_pub {
ios.post([&f, e]{
socket.my_async_write(boost::asio::buffer(e.payload), 完了ハンドラ);
}
});
}
};
自前の非同期writeを呼び出す
subscribe / unsubscribe の ack送信処理も、同様に、
自前の非同期writeを経由させることで、順序の入れ替わりを
防ぎ、かつ、処理の並列化を実現することができる

publish処理
2016/7/23 38
struct a_pub {
ios.post([&f, e]{
}
});
}
};
post
postのみserialize
かつ
同一接続に対しては
シリアライズ
my_async_write
subscriber
subscriber
post
subscriber
subscriber
my_async_write
かつ
シリアライズ

publish処理
2016/7/23 39
struct a_pub {
ios.post([&f, e]{
}
});
}
};
非同期writeは十分に軽量であるため、forループの所要時間は短かった。
排他ロックの中で処理を行ってもパフォーマンスは落ちなかった。
よってシンプルな実装を採用した。（グレーの部分のコードを削除した）

publish処理
2016/7/23 40
struct a_pub {
ios.post([&f, e]{
}
});
}
};
post
postのみserialize
かつ
シリアライズ
my_async_write
subscriber
subscriber
post
subscriber
subscriber
my_async_write
かつ
シリアライズ

publish処理
2016/7/23 41
struct a_pub {
ios.post([&f, e]{
}
});
}
};
my_async_writeのみserialize
かつ
シリアライズ
my_async_write
subscriber
subscriber
subscriber
subscriber
排他ロック
my_async_write
かつ
シリアライズ

まとめ
2016/7/23 42
• io_serviceを複数スレッドでrun()することで、
コアを有効利用できる
• msmのDeferはイベント処理を遅延できて便利
• その一方、msmの状態遷移は排他制御を要求する
• post()を利用することで任意の処理を、
遅延でき、ロックの最適化が可能となる
• post()はコネクションを意識しないので、
マルチスレッドの場合、実行順序が保証されない
• 通信では同一コネクションに対して、
順序を保証したいことがよくある
• そんなときは、async_write()が使える
• 好きなタイミングで呼べるasync_write()は
自分で実装する必要がある
• キューイング処理とasync_writeハンドラに加え、
async_read()も合わせてstrandする必要がある

Pub/Sub model, msm, and asio

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (6)

Similar to Pub/Sub model, msm, and asio

Similar to Pub/Sub model, msm, and asio (20)

More from Takatoshi Kondo

More from Takatoshi Kondo (9)

Recently uploaded

Recently uploaded (6)

Pub/Sub model, msm, and asio