This document discusses testing and comparing different numerical integration methods for simulating Cosserat rod and rigid contact mechanics (RCM) models. It finds that the θ-method is most stable for the Cosserat rod model, while RCM simulations are less stable. The stiff second-order differential equations mean implicit methods like RADAU5 and θ are required. Performance profiling shows the θ-method has the best performance for the Cosserat rod model, while initialization is most expensive for both models.
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
call for paper 2012, hard copy of journal, research paper publishing, where to publish research paper,
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
call for paper 2012, hard copy of journal, research paper publishing, where to publish research paper,
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals
The myphotonics project deals with the construction of opto-mechanical components and optical experiment implementation using modular systems such as LEGO®.
The components are low cost and the instructions that originated them are free to use. OpenAdaptonik and myphotonics can work together sharing the same purpose.
KMEM4212_Applied Vibration_Group Assignment_Report_CL 3Max Lee
KMEM4212 Applied Vibration (University of Malaya)
Co-operative Learning 3 (CL 3)
This report contains a clear methodology that may be helpful for those who wish to run the experiment by themselves.
Sharing with you (Mechanical Engineering students) who may be benefited from it.
Feel free to connect with me at maxermesilliam@gmail.com
vFORTRAN is used as a numerical and scientific computing language. The main objective of the lab work is to understand FORTRAN language using which we solve simple numerical problems and compare different methodologies. Through this project we make use of various functions of FORTRAN and solve a simple projectile problem and also LAPACK library to solve a tridiagonal matrix problem. We use DGESV and DGTSV functions to make it possible. The given problems are built and compiled using a free integrated development environment called CODE::BLOCKS [1] which is an open source platform for FORTRAN and C.
In industry, bogey testing, also known as the zero-failure testing, is often used to demonstrate product reliability. This test method is simple to apply; however, it requires excessive test time and/or a large sample size, and thus is usually unaffordable. For some products whose failure is defined as a performance characteristic exceeding a threshold, it is possible to measure the performance characteristic during testing. The measurement data can be employed to predict whether or not a test unit will fail by the end of test. When there are sufficient data to make such a prediction with a high degree of confidence, the test of the unit can be terminated. As a result, the test time is reduced.
This presentation describes the test method, sample size computation, degradation models, and cost function for the lognormal bogey testing. Then the presentation discusses the optimum test plans, which choose the optimal sample size and the expected test time by minimizing the total test cost and simultaneously satisfying the constraints on the type II error and the available sample size. An example is given to illustrate the test method.
Transient three dimensional cfd modelling of ceilng fanLahiru Dilshan
Ceiling fans are used to get thermal comfort, especially in tropical countries. With the increment of the usage of air conditioners, the emission of CO2 is increased. But ceiling fans are a limited solution, that saves much energy compared to air conditioners. Ceiling fans generate a non-uniform velocity profile, so that, there is a non-uniform thermal environment. That non-uniform environment does not imply lower thermal comfort, that will give enough thermal comfort with low energy cost by air velocity. Hence, there will be difficulties of analysing with simple modelling techniques in that environment. So, to predict the performance of the ceiling fan required more accurate models.
The accurate model of a ceiling fan will generate complex geometry that makes difficulties for the simulation process and requires higher computational power. Because of that, there are several methods used to predict the performance of the ceiling fan using mathematical techniques but that will give an estimated value of properties in the surrounding.
Energy-Based Control of Under-Actuated Mechanical Systems - Remotely Driven A...Mostafa Shokrian Zeini
This presentation concerns the energy-based swing-up control for a remotely driven acrobot (RDA) which is a 2-link planar robot with the first link being underactuated and the second link being remotely driven by an actuator mounted at a fixed base through a belt.
A landing gear assembly consists of various components viz. Lower side stay, Upperside stay, Locking actuators, Extension actuators, Tyres, and Locking pins to name a few. Each unit having a specific operation to deal with, in this project the main unit being studied is the lower brace. The primary objective is to analyse stresses in the element of the lower brace unit using strength of materials or RDM method and Finite Element Method (FEM) and compare both. Using the obtained data a suitable material is proposed for the component. The approach used here is to study the overall behaviour of the element by taking up each aspect, finally summing up the total effect of all the aspects in the functioning of the element.
The myphotonics project deals with the construction of opto-mechanical components and optical experiment implementation using modular systems such as LEGO®.
The components are low cost and the instructions that originated them are free to use. OpenAdaptonik and myphotonics can work together sharing the same purpose.
KMEM4212_Applied Vibration_Group Assignment_Report_CL 3Max Lee
KMEM4212 Applied Vibration (University of Malaya)
Co-operative Learning 3 (CL 3)
This report contains a clear methodology that may be helpful for those who wish to run the experiment by themselves.
Sharing with you (Mechanical Engineering students) who may be benefited from it.
Feel free to connect with me at maxermesilliam@gmail.com
vFORTRAN is used as a numerical and scientific computing language. The main objective of the lab work is to understand FORTRAN language using which we solve simple numerical problems and compare different methodologies. Through this project we make use of various functions of FORTRAN and solve a simple projectile problem and also LAPACK library to solve a tridiagonal matrix problem. We use DGESV and DGTSV functions to make it possible. The given problems are built and compiled using a free integrated development environment called CODE::BLOCKS [1] which is an open source platform for FORTRAN and C.
In industry, bogey testing, also known as the zero-failure testing, is often used to demonstrate product reliability. This test method is simple to apply; however, it requires excessive test time and/or a large sample size, and thus is usually unaffordable. For some products whose failure is defined as a performance characteristic exceeding a threshold, it is possible to measure the performance characteristic during testing. The measurement data can be employed to predict whether or not a test unit will fail by the end of test. When there are sufficient data to make such a prediction with a high degree of confidence, the test of the unit can be terminated. As a result, the test time is reduced.
This presentation describes the test method, sample size computation, degradation models, and cost function for the lognormal bogey testing. Then the presentation discusses the optimum test plans, which choose the optimal sample size and the expected test time by minimizing the total test cost and simultaneously satisfying the constraints on the type II error and the available sample size. An example is given to illustrate the test method.
Transient three dimensional cfd modelling of ceilng fanLahiru Dilshan
Ceiling fans are used to get thermal comfort, especially in tropical countries. With the increment of the usage of air conditioners, the emission of CO2 is increased. But ceiling fans are a limited solution, that saves much energy compared to air conditioners. Ceiling fans generate a non-uniform velocity profile, so that, there is a non-uniform thermal environment. That non-uniform environment does not imply lower thermal comfort, that will give enough thermal comfort with low energy cost by air velocity. Hence, there will be difficulties of analysing with simple modelling techniques in that environment. So, to predict the performance of the ceiling fan required more accurate models.
The accurate model of a ceiling fan will generate complex geometry that makes difficulties for the simulation process and requires higher computational power. Because of that, there are several methods used to predict the performance of the ceiling fan using mathematical techniques but that will give an estimated value of properties in the surrounding.
Energy-Based Control of Under-Actuated Mechanical Systems - Remotely Driven A...Mostafa Shokrian Zeini
This presentation concerns the energy-based swing-up control for a remotely driven acrobot (RDA) which is a 2-link planar robot with the first link being underactuated and the second link being remotely driven by an actuator mounted at a fixed base through a belt.
A landing gear assembly consists of various components viz. Lower side stay, Upperside stay, Locking actuators, Extension actuators, Tyres, and Locking pins to name a few. Each unit having a specific operation to deal with, in this project the main unit being studied is the lower brace. The primary objective is to analyse stresses in the element of the lower brace unit using strength of materials or RDM method and Finite Element Method (FEM) and compare both. Using the obtained data a suitable material is proposed for the component. The approach used here is to study the overall behaviour of the element by taking up each aspect, finally summing up the total effect of all the aspects in the functioning of the element.
The part is axisymmetrically modeled in solidworks(2D) before importing to ansys workbench where the boundary zones are identified and appropriate mesh settings is applied. The model is then imported in Fluent for analysis . Significant setting changes are Density based solver , Enhanced Eddy viscosity model with near wall treatment , solution steering , FMG initialization etc.
(SAC2020 SVT-2) Constrained Detecting Arrays for Fault Localization in Combin...Hao Jin
Authors:
Hao Jin, Osaka University
Ce Shi, Shanghai Lixin University of Accounting and Finance
Tatsuhiro Tsuchiya, Osaka University
Abstract:
Detecting Arrays (DAs) are mathematical objects that enable fault localization in combinatorial interaction testing. Each row of a DA serves as a test case, whereas a whole DA is treated as a test suite. In real-world testing problems, it is often the case that some constraints exist among test parameters. In this paper, we show that it may be impossible to construct a DA using only constraint-satisfying test cases. The reason for this is that a set of some faulty interactions may always mask the effect of other faulty interactions in the presence of constraints. Based on this observation, we propose the notion of Constrained Detecting Arrays (CDAs) to adapt DAs to practical situations. The definition of CDAs requires that all rows of a CDA must satisfy the constraints and the same fault localization capability as the DA must hold except for such inherently undetectable faults. We then propose a computational method for constructing CDAs. Experimental results obtained by using a program that implements the method show that the method was able to produce CDAs within a reasonable time for practical problem instances.
Case Quality Management—ToyotaQuality Control Analytics at Toyo.docxcowinhelen
Case: Quality Management—Toyota
Quality Control Analytics at Toyota
As part of the process for improving the quality of their cars, Toyota engineers have identifi ed a potential improvement does happen to get too large, it can cause the accelerator to bind and create a potential problem for the driver. (Note: This part of the case has been fabricated for teaching purposes, and none of these data were obtained from Toyota.)
Let’s assume that, as a first step to improving the process, a sample of 40 washers coming from the machine that produces the washers was taken and the thickness measured in millimeters. The following table has the measurements from the sample:
1.9 2.0 1.9 1.8 2.2 1.7 2.0 1.9 1.7 1.8
1.8 2.2 2.1 2.2 1.9 1.8 2.1 1.6 1.8 1.6
2.1 2.4 2.2 2.1 2.1 2.0 1.8 1.7 1.9 1.9
2.1 2.0 2.4 1.7 2.2 2.0 1.6 2.0 2.1 2.2
Questions
1 If the specification is such that no washer should be greater than 2.4 millimeters, assuming that the thick-nesses are distributed normally, what fraction of the output is expected to be greater than this thickness?
The average thickness in the sample is 1.9625 and the standard deviation is .209624. The probability that the thickness is greater than 2.4 is Z = (2.4 – 1.9625)/.209624 = 2.087068 1 - NORMSDIST(2.087068) = .018441 fraction defective, so 1.8441 percent of the washers are expected to have a thickness greater than 2.4.
2 If there are an upper and lower specification, where the upper thickness limit is 2.4 and the lower thick-ness limit is 1.4, what fraction of the output is expected to be out of tolerance?
The upper limit is given in a. The lower limit is 1.4 so Z = (1.4 – 1.9625)/.209624 = -2.68337. NORMSDIST(-2.68337) = .003644 fraction defective, so .3644 percent of the washers are expected to have a thickness lower than 1.4. The total expected fraction defective would be .018441 + .003644 = .022085 or about 2.2085 percent of the washers would be expected to be out of tolerance.
3 What is the Cpk for the process?
4 What would be the Cpk for the process if it were centered between the specification limits (assume the process standard deviation is the same)?
The center of the specification limits is 1.9, which is used for X-bar in the following:
5 What percentage of output would be expected to be out of tolerance if the process were centered?
Z = (2.4 – 1.9)/.209624 = 2.385221
Fraction defective would be 2 x (1-NORMSDIST(2.385221)) = 2 x .008534 = .017069, about 1.7 percent.
6 Set up X - and range control charts for the current process. Assume the operators will take samples of 10 washers at a time.
Observation
Sample
1
2
3
4
5
6
7
8
9
10
X-bar
R
1
1.9
2
1.9
1.8
2.2
1.7
2
1.9
1.7
1.8
1.89
0.5
2
1.8
2.2
2.1
2.2
1.9
1.8
2.1
1.6
1.8
1.6
1.91
0.6
3
2.1
2.4
2.2
2.1
2.1
2
1.8
1.7
1.9
1.9
2.02
0.7
4
2.1
2
2.4
1.7
2.2
2
1.6
2
2.1
2.2
2.03
0.8
Mean:
1.9625
0.65
From Exhibit 10.13, with sample size of 10, A2 = .31, D3 = .22 and D4 = 1.78
The upper control limit for the X-bar ch.
Unit 1: Fundamentals of the Analysis of Algorithmic Efficiency, Units for Measuring Running Time, PROPERTIES OF AN ALGORITHM, Growth of Functions, Algorithm - Analysis, Asymptotic Notations, Recurrence Relation and problems
(Slides) Efficient Evaluation Methods of Elementary Functions Suitable for SI...Naoki Shibata
Naoki Shibata : Efficient Evaluation Methods of Elementary Functions Suitable for SIMD Computation, Journal of Computer Science on Research and Development, Proceedings of the International Supercomputing Conference ISC10., Volume 25, Numbers 1-2, pp. 25-32, 2010, DOI: 10.1007/s00450-010-0108-2 (May. 2010).
http://www.springerlink.com/content/340228x165742104/
http://freshmeat.net/projects/sleef
Data-parallel architectures like SIMD (Single Instruction Multiple Data) or SIMT (Single Instruction Multiple Thread) have been adopted in many recent CPU and GPU architectures. Although some SIMD and SIMT instruction sets include double-precision arithmetic and bitwise operations, there are no instructions dedicated to evaluating elementary functions like trigonometric functions in double precision. Thus, these functions have to be evaluated one by one using an FPU or using a software library. However, traditional algorithms for evaluating these elementary functions involve heavy use of conditional branches and/or table look-ups, which are not suitable for SIMD computation. In this paper, efficient methods are proposed for evaluating the sine, cosine, arc tangent, exponential and logarithmic functions in double precision without table look-ups, scattering from, or gathering into SIMD registers, or conditional branches. We implemented these methods using the Intel SSE2 instruction set to evaluate their accuracy and speed. The results showed that the average error was less than 0.67 ulp, and the maximum error was 6 ulps. The computation speed was faster than the FPUs on Intel Core 2 and Core i7 processors.
We consider the problem of finding anomalies in high-dimensional data using popular PCA based anomaly scores. The naive algorithms for computing these scores explicitly compute the PCA of the covariance matrix which uses space quadratic in the dimensionality of the data. We give the first streaming algorithms
that use space that is linear or sublinear in the dimension. We prove general results showing that any sketch of a matrix that satisfies a certain operator norm guarantee can be used to approximate these scores. We instantiate these results with powerful matrix sketching techniques such as Frequent Directions and random projections to derive efficient and practical algorithms for these problems, which we validate over real-world data sets. Our main technical contribution is to prove matrix perturbation
inequalities for operators arising in the computation of these measures.
-Proceedings: https://arxiv.org/abs/1804.03065
-Origin: https://arxiv.org/abs/1804.03065
Medical Conferences, Pharma Conferences, Engineering Conferences, Science Conferences, Manufacturing Conferences, Social Science Conferences, Business Conferences, Scientific Conferences Malaysia, Thailand, Singapore, Hong Kong, Dubai, Turkey 2014 2015 2016
Global Research & Development Services (GRDS) is a leading academic event organizer, publishing Open Access Journals and conducting several professionally organized international conferences all over the globe annually. GRDS aims to disseminate knowledge and innovation with the help of its International Conferences and open access publications. GRDS International conferences are world-class events which provide a meaningful platform for researchers, students, academicians, institutions, entrepreneurs, industries and practitioners to create, share and disseminate knowledge and innovation and to develop long-lasting network and collaboration.
GRDS is a blend of Open Access Publications and world-wide International Conferences and Academic events. The prime mission of GRDS is to make continuous efforts in transforming the lives of people around the world through education, application of research and innovative ideas.
Global Research & Development Services (GRDS) is also active in the field of Research Funding, Research Consultancy, Training and Workshops along with International Conferences and Open Access Publications.
International Conferences 2014 – 2015
Malaysia Conferences, Thailand Conferences, Singapore Conferences, Hong Kong Conferences, Dubai Conferences, Turkey Conferences, Conference Listing, Conference Alerts
Similar to Simulation Testing for Cosserat Rods (20)
2. 1 Characterization of stiffness[wiki stiffEquation]
A linear constant coefficient system is stiff if all of its eigenvalues have negative real
part and the stiffness ratio is large. Stiffness occurs when stability requirements, rather
than those of accuracy, constrain the step length. Stiffness occurs when some
components of the solution decay much more rapidly than others.[2]
As the stiffness ratio is determined by the property of the system, which includes the deformation
flexible rod and movement of rigid body. The stiffness cannot eliminate by improving the formula.
The only way is to use A-stable Integrator.
2 Some Notes
• LSODE: adaptive timestep size, to look for angular and position
• h can not larger than 1e-6, otherwise collapse for all solver? theta method?? see the new
simulation result.
• trapezoidal method (θ = 0.5) does not have perfect stability: it does damp all decaying
components, but rapidly decaying components are damped only very mildly.
• There are no explicit A-stable and linear multistep methods. The implicit ones have order of
convergence at most 2.
• The trapezoidal rule has the smallest error constant amongst the A-stable linear multistep
methods of order 2.
3 Oscillating Ring
The following simulation results are based on nine layer model.
• In the following simulation, ∆R = 0.1R, actually ∆R ≈ 1e-3 ∼ 1e-4 R in general.
• For the SSC: setMaxOrder(4,1), 4 means choose the order 4, which is the maximum, 1 means
method 1 which is embedded method (compare maxOrder maxOrder+1); proceed with
maxOrder (recommended).
• TimeSteppingIntegrator: if the number of elements is small, and time step size is extremely
small, it will not crash during certain simulation time.
Table 1: Crash Time
Timestep elements number=20 elements number=8
1e-6 0.00001s 0.00002s
1e-7 0.00001s -
1e-8 0.001s 0.01229s
1e-9 0.0089 -
• LSODE: does not improve so much,
– crash time(elements number=20): 0.001s
2
3. – crash time(elements number=10): 0.002s
– crash time(elements number=8): 0.19818s (the step size is during 1e-8 ∼ 1e-9)
– use the option set.stiff does not helps so much, crash at 0.009s(N=20) 0.1469s(N=8)
• SSC: when N=20, it does not improve so much. It crash as soon as it reach the minimum
time step size, which is 5.96046e-08s. So during simulation, as soon as we see it reach teh
minimu time setp size, we should stop.
The simulation result in the table is based on ∆R = 0.01R .
Table 2: Stability Test
Model TimeStepping LSODE SSC RADAU5 Theta
one layer Y(1e-7) Y(1e-7) Y Y Y
nine layer N(1e-9) N N Y Y
If using one layer model to calculate the area inertial moment I, all these method converge. If
using nine layer model, RADAU5 and integrator work.
4 Pearl Chain CVT Setting
I1 also have to be divided by pow(Nr, 2) as it is also consist of two separate part.
Table 3: Stability Test
I TimeStepping LSODE SSC RADAU5 Theta
I0 = I1 + I2 N N lack of Memory - Y
I0 = 0.32 ∗ (2wr) ∗ hr3
N N lack of Memory - Y
For the area inertial moment I2, one layer model and nine layer model are corresponding to two
extreme cases. The system will have the biggest stiffness ratio if the nine layer model is chosen.
If the nine layer cannot be solved by the θ method, then we should improve this value by
• choose some value between the the one layer model and nine layer model, use bisection to
find the critical point
• experiment data
• use ANSYS to simulate to get the value if the friction coefficient is known.
• analysis how much does the inertial moment effects the simulation result which we need.
In the testing, for nine layer model, θ method is even faster than the SSC method, as the SSC
method is reaching the minimum time step size,which is 5.96046e-08s or 2.98023e-08(with new
setting).
4.1 Results of θ method
It seems that the area inertial moment I2 calculated by the nine layer model is too small so that
the ring seems to be too elastic. The correct value should between the values calculated by the
nine layer model and the one layer model. (I2one/I2nine = 81) we try to use bisection to find the
3
4. critical point, testing different value of I2 which is between I2one and I2nine:
I2 = K ∗ I2nine, K = 1, 21, 41, 61, 81.
After testing, we decide to choose the I2 = 41 ∗ I2nine temporarily.
Table 4: Stability Test of θ Method: I0 = I1 + I2, θ = 0.5
I 1e-5 5e-5 1e-4 5e-4 1e-3
I2 = 1 ∗ I2nine
I2 = 21 ∗ I2nine
I2 = 41 ∗ I2nine Y(0.19805) 0.0752 0.0767 0.075
I2 = 61 ∗ I2nine
I2 = 81 ∗ I2nine
Table 5: Stability Test of θ Method: I0 = I1 + I2, θ = 0.878
I 1e-5 5e-5 1e-4 5e-4 1e-3
I2 = 1 ∗ I2nine Y(may stop around 0.19s) -
I2 = 21 ∗ I2nine Y(may stop around 0.19s) 0.0753s
I2 = 41 ∗ I2nine Y(0.1997s) 0.0745s 0.0756s 0.082s
I2 = 61 ∗ I2nine Y(may stop around 0.19s) 0.0753s
I2 = 81 ∗ I2nine Y(may stop around 0.19s) 0.0737s 0.0745s 0.0835s
Table 6: Stability Test of θ Method: I0 = I1 + I2, θ = 1
I 1e-5 5e-5 1e-4 5e-4 1e-3())
I2 = 1 ∗ I2nine
I2 = 21 ∗ I2nine
I2 = 41 ∗ I2nine Y(0.19325s) 0.0737s 0.0658s
I2 = 61 ∗ I2nine 0.092s
I2 = 81 ∗ I2nine
Comparing the results of simulation by setting θ = 1,θ = 0.5, it shows a slightly difference after
about 0.095s. But the overall tends of them are the same. When θ = 0.5, the solution is more
oscillating locally than that of when θ = 1. But when θ = 0.5, the global error is controlled better
than that when θ = 1. So it diverges later. It indicates that a more smaller stepsize is needed for
more accurate solution and longer simulation.
4.2 Results of SSC
The SSC is very slow for solving Pearl Chain CVT Setting as it always reach the minimum stepsize
which is too small. And the memory consumption is quite high, causing the simulation stops at
about 0.05s. So the SSC is infeasible for this simulation.
4
5. Table 7: Stability Test of SSC Method: I0 = I1 + I2, SSC, SSCNewSetting
I SSC(setMaxOrder(2,1)) SSC(setMaxOrder(1,0), setFlagErrorTest(2,false),
SetGapControl(true, 1))
I2 = 1 ∗ I2nine N N(0.01383s, lack of Memory)
I2 = 21 ∗ I2nine
I2 = 41 ∗ I2nine Y N(0.058s, lack of Memory)
I2 = 61 ∗ I2nine
I2 = 81 ∗ I2nine Y
Table 8: Stability Test of SSC Method: I0 = 0.32 ∗ (2wr) ∗ hr3
I SSC(setMaxOrder(2,1)) SSC(setMaxOrder(1,0), setFlagErrorTest(2,false),
SetGapControl(true, 1))
I2 = 1 ∗ I2nine
I2 = 21 ∗ I2nine
I2 = 41 ∗ I2nine
I2 = 61 ∗ I2nine
I2 = 81 ∗ I2nine
4.3 Comparison of RCM and Cosserat
4.3.1 Different Parameters Setting
4.3.2 Integrator for the RCM
TimeSteppingIntegrator: 1e-6(Converge) 5e-5(No convergence) 1e-5(No convergence)
4.3.3 Elastic
When I2 = 1 ∗ I2nine, the results of both methods show a high elastic property.
4.3.4 Results
The overall tend of results of these two methods are the same. The results of RCM model is
oscillating locally. A new simulation with time step size equals 1e − 9 is executed to check whether
the oscillating is generated as the timestep size is large(1e − 6). The simulation is still running.
4.3.5 Performance Analysis
1. Preparation: compile the code with debugging info (the -g option) and with optimization
(-O3) turned on.
2. To start a profile run for a program, execute: ”valgrind –tool=callgrind –cache-sim=yes
–branch-sim=yes ./main”, an output file named callgrind.out.pid will be generated.
3. Visualization the output file: use the tool caller Kcachegrind. Usage: ”kcachegrind
cachegrind.out.pid”
The meaning of some abbreviation in Kcachegrind:
Ir : Instruction Read
5
6. Table 9: Different Parameters Setting of RCM and Cosserat
I RCM Cosserat
I1 t-n t-b
I2 t-b t-n
I0
setCurlRadius R1: t-n R2: t-b R1: t-b R2: t-n
Incl. : Inclusive Cost. Cost attributes for functions regarding some event type, including all called
functions.
Self : Exclusive Cost. Cost attributes for functions regarding some event type, only of the
function itself.
Table 10: Performance of Cosserat
initialize updateStateDependentVariables updateJacobians updateG
TimeStepping 17.27 - 19.74 34.2
SSC 5.92 5.06 15.67 54.37
theta - 29.52 45.66 -
Table 11: Performance of RCM
initialize updateJacobians updateG updateg updateh
TimeStepping 35.8 9.03 22.43 11.91 9.63
SSC 19.33 6.67 33.04 10.08 7.01
6