3. Altic tools / approach
âą ETL : Talend
âą Big Data : Spark, Hortonworks Data
Platform (Hadoop), Elasticsearch
âą Data Warehouse : InfiniDB
âą Reporting : JasperReports, Birt
âą OLAP : Mondrian, Palo
âą Dashboard : Tableau Software, D3
âą BI platform : SpagoBI
Twitter www.ow2.org #ow2 #sl2014 @Altic_buzz
4. Biclustring on Big Data
Twitter www.ow2.org #ow2 #sl2014 @Altic_buzz
â Tugdual SARAZIN, PhD
â ALTIC
â LIPEN (Paris 13)
â Biclustring
â a Biclustring algorithm on Big Data
â Spark
â Based on SOM â Self Organized Map
â Available on Github : Spark-Clustering
5. Integration with SpagoBI
â Spark Bi Clustering can be an engine for SpagoBI
â Define a data set as input
â Execute the biclustering with appropriate settings
â Store result in a defined format
â Databases
â Big data storage (HDFS)
â SpagoBI Dataset
Twitter www.ow2.org #ow2 #sl2014 @Altic_buzz
6. Integration with Talend
â Spark Biclustering can be a component for Talend Big Data
â Add new features to existing Talend Big Data components
Twitter www.ow2.org #ow2 #sl2014 @Altic_buzz
â Biclustering
â Allow to map your data