Ellipsoidal Representations about correlations (2011-11, Tsukuba, Kakenhi-Sym...Toshiyuki Shimono
A fundamental theory in statistics, possibly applicable to data mining, machine learning, as well as epistemology. The principia mathematica of mine, 2nd version.
The correlation coefficient between random walk and time has a characteristic shape of histogram or density function. Some findings has been revealed and it is desirable to be investigated more.
A consideration about picking up the `best' values from N times experiences. Mathematically, the author considered "how the maximum value of N values that are distributed normally distributes?". The median values are 0.998, 1.498, 1.998 for N=4,10,30, respectively.
This proposition would be useful because everybody quite often is in need of choosing something best from multiple candidates in their daily or social life. The figures shown above are easy to remember so you can utilize them with ease.
Despite the existence of data analysis tools such as R, SQL, Excel and others, it is still insufficient to cope with today's big data analysis needs.
The author proposes a CUI (Character User Interface) toolset with dozens of functions to neatly handle tabular data in TSV (Tab Separated Values) files.
It implements many basic and useful functions that have not been implemented in existing software with each function borrowing the ideas of Unix philosophy and covering the most frequent pre-analysis tasks during the initial exploratory stage of data analysis projects.
Also, it greatly speeds up basic analysis tasks, such as drawing cross tables, Venn diagrams, etc., while existing software inevitably requires rather complicated programming and debugging processes for even these basic tasks.
Here, tabular data mainly means TSV (Tab-Separated Values) files as well as other CSV (Comma Separated Value)-type files which are all widely used for storing data and suitable for data analysis.
Ellipsoidal Representations about correlations (2011-11, Tsukuba, Kakenhi-Sym...Toshiyuki Shimono
A fundamental theory in statistics, possibly applicable to data mining, machine learning, as well as epistemology. The principia mathematica of mine, 2nd version.
The correlation coefficient between random walk and time has a characteristic shape of histogram or density function. Some findings has been revealed and it is desirable to be investigated more.
A consideration about picking up the `best' values from N times experiences. Mathematically, the author considered "how the maximum value of N values that are distributed normally distributes?". The median values are 0.998, 1.498, 1.998 for N=4,10,30, respectively.
This proposition would be useful because everybody quite often is in need of choosing something best from multiple candidates in their daily or social life. The figures shown above are easy to remember so you can utilize them with ease.
Despite the existence of data analysis tools such as R, SQL, Excel and others, it is still insufficient to cope with today's big data analysis needs.
The author proposes a CUI (Character User Interface) toolset with dozens of functions to neatly handle tabular data in TSV (Tab Separated Values) files.
It implements many basic and useful functions that have not been implemented in existing software with each function borrowing the ideas of Unix philosophy and covering the most frequent pre-analysis tasks during the initial exploratory stage of data analysis projects.
Also, it greatly speeds up basic analysis tasks, such as drawing cross tables, Venn diagrams, etc., while existing software inevitably requires rather complicated programming and debugging processes for even these basic tasks.
Here, tabular data mainly means TSV (Tab-Separated Values) files as well as other CSV (Comma Separated Value)-type files which are all widely used for storing data and suitable for data analysis.
Тульчинский Г.Л. Брендинг мест: на примере Республики Комиprasu1995
Опубликовано в сборнике: PR и реклама в изменяющемся мире: Региональный аспект [Текст] : сборник статей/ под ред. М.В. Гундарина, А. Г. Сидоровой, Ю. В. Явинской. – Вып. 10. – Барнаул: Изд-во Алт. ун-та, 2012.
Peter Ramsden gave an overview on the process and scope of social innovation. He pointed out the essential role of the public sector and emphasised the need to involve all the stakeholders – above all the target group – and to focus on results. Part of his presentation also focused on the chances of innovative financing.
Three trends that will shape the future of B2B digital marketingLabbrand
B2B content isn't what it used to be. As content becomes more and more important in shaping the outcome of buyers’ decisions, brands have to explore new channels and content types to influence buyers all along their journey. These 3 key trends in B2B marketing will give you pointers as to how you can build a more innovative and meaningful content strategy for your brand.
Тульчинский Г.Л. Брендинг мест: на примере Республики Комиprasu1995
Опубликовано в сборнике: PR и реклама в изменяющемся мире: Региональный аспект [Текст] : сборник статей/ под ред. М.В. Гундарина, А. Г. Сидоровой, Ю. В. Явинской. – Вып. 10. – Барнаул: Изд-во Алт. ун-та, 2012.
Peter Ramsden gave an overview on the process and scope of social innovation. He pointed out the essential role of the public sector and emphasised the need to involve all the stakeholders – above all the target group – and to focus on results. Part of his presentation also focused on the chances of innovative financing.
Three trends that will shape the future of B2B digital marketingLabbrand
B2B content isn't what it used to be. As content becomes more and more important in shaping the outcome of buyers’ decisions, brands have to explore new channels and content types to influence buyers all along their journey. These 3 key trends in B2B marketing will give you pointers as to how you can build a more innovative and meaningful content strategy for your brand.
Theory to consider an inaccurate testing and how to determine the prior proba...Toshiyuki Shimono
I presented a mathematical theory on a medical testing method. This fundamental theory can be taken account of both cases when the resource of the testing is limited or not. One implication is that "negative proof" may not function well, and another implication is that excessively high specificity and accuracy are required for meaningful diagnosis unless the careful usage of the diagnosis is considered.
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...Toshiyuki Shimono
Different-sized discrete crosses placed in an organized lattice pattern can assist the human eyes to read numerical values on statistical graphs, enabling more precise interpretation and enlarging the utility of statistical graphs that visually represent numerical quantities. This paper presents a novel graph-plotting method that places roughly ten thousand of separated grids on a graph, providing human data analysis with an easy access to arbitrary numerical readouts from a statistical graph. At present, this functionality has been lacking in the existing graph-plotting softwares.
To Make Graphs Such as Scatter Plots Numerically Readable (PacificVis 2018, K...Toshiyuki Shimono
Different-sized discrete crosses placed in an organized lattice pattern can assist the human eyes to read numerical values on statistical graphs, enabling more precise interpretation and enlarging the utility of statistical graphs that visually represent numerical quantities. This paper presents a novel graph-plotting method that places roughly ten thousand of separated grids on a graph, providing human data analysis with an easy access to arbitrary numerical readouts from a statistical graph. At present, this functionality has been lacking in the existing graph-plotting softwares.
Make Accumulated Data in Companies Eloquent by SQL Statement Constructors (PDF)Toshiyuki Shimono
Presented at IEEE BigData 2017, Boston, on Dec 11, 2017
in the Workshop of "3rd International Workshop on Methodologies to Improve Big Data projects".
The author is Toshiyuki Shimono, Digital Garage, Inc.
(This is PDF format instead of MS Powerpoint format for the sake of significantly smaller file size.)
38. 用いたR言語のスクリプト
par(mfrow=c(4,6))
K=800;H=4; # Change the value of K to 5, 7, 10, 15, 20, 25,
mar=0.2;cex=.5 # if K < 400 , set cex=1. if K >= 400 set cex=0.5
for(i in 1:24){
par(mar=rep(mar,4),xaxt="n",yaxt="n",xlab="",ylab="")
plot(NA,NA,xlim=c(-H,H),ylim=c(-H,H)) ;
points(0,0,pch=3,cex=15,col="gray80")
par(new=T)
plot(rnorm(K),rnorm(K),xlim=c(-H,H),ylim=c(-H,H) , pch=16,cex=cex) #,
next
par(new=T);
symbols(rep(0,2),rep(0,2),circles=rep(2,2),
xaxt="n",
yaxt="n",xlab="",ylab="",fg="gray60",cex=3, inches=F
,xlim=c(-H,H),ylim=c(-H,H))
}
40. 用いたR言語のスクリプト
(正方形内の一様分布)
par(mfrow=c(8,12))
for(K in c(5,10,20,30,40,60,80,100)){
#K=10;
H=4; # Change the value of K to 5, 7, 10, 15, 20, 25,
mar=0.2;cex=sqrt(20)/sqrt(K) # if K < 400 , set cex=1. if K >= 400 set cex=0.5
rp <- function(k){runif(K,-H*.9,H*.9)}
for(i in 1:12){
par(mar=rep(mar,4),xaxt="n",yaxt="n",xlab="",ylab="")
plot(NA,NA,xlim=c(-H,H),ylim=c(-H,H)) ;
points(0,0,pch=3,cex=15,col="gray80")
par(new=T)
plot(rp(K),rp(K),xlim=c(-H,H),ylim=c(-H,H) , pch=16,cex=cex) #,
text(H*.93,-H*.95,paste(K),col="skyblue")
next
par(new=T);
symbols(rep(0,2),rep(0,2),circles=rep(2,2),
xaxt="n",
yaxt="n",xlab="",ylab="",fg="gray60",cex=3, inches=F
,xlim=c(-H,H),ylim=c(-H,H))
}
}