This document summarizes a PPDG focus meeting on robust file replication. Key points from presentations include:
- JLAB is developing a web services-based replication system using a MySQL replica catalog and HRM listeners.
- SRB is enhancing its replication capabilities to include asynchronous replica creation and metadata registration.
- Globus is developing a Replica Location Service and Reliable File Transfer Service built on top of its replica catalog.
- GDMP and MAGDA provide file replication functionality for CMS and ATLAS respectively, using Globus and custom components.
Discussion focused on defining common interfaces for replication services, addressing consistency and error handling, and the roles of components like replica catalogs, storage managers,
A comparative survey based on processing network traffic data using hadoop pi...ijcses
Big data analysis has now become an integral part of many computational and statistical departments.
Analysis of peta-byte scale of data is having an enhanced importance in the present day scenario. Big data
manipulation is now considered as a key area of research in the field of data analytics and novel
techniques are being evolved day by day. Thousands of transaction requests are being processed in every
minute by different websites related to e-commerce, shopping carts and online banking. Here comes the
need of network traffic and weblog analysis for which Hadoop comes as a suggested solution. It can
efficiently process the Netflow data collected from routers, switches or even from website access logs at
fixed intervals.
Taming the PDB: Resource Management and Lockdown ProfilesMarkus Flechtner
Managing a large multitenant database with many pluggable database can be a difficult task. Many PDBs fight for the server resources like I/O, CPU and memory. It can be difficult to keep the SLAs agreed with your customers. Oracle Database 12c Release 2 offers improvments in resource management for PDBs and a new feature called „Lockdown Profiles“ which helps you to limit the available features on PDB level. The talk shows the various areas of these two features in a CDB environment and shows how they will help you managing a multitenant environment. And both features can help even with a single-tenant database.
A comparative survey based on processing network traffic data using hadoop pi...ijcses
Big data analysis has now become an integral part of many computational and statistical departments.
Analysis of peta-byte scale of data is having an enhanced importance in the present day scenario. Big data
manipulation is now considered as a key area of research in the field of data analytics and novel
techniques are being evolved day by day. Thousands of transaction requests are being processed in every
minute by different websites related to e-commerce, shopping carts and online banking. Here comes the
need of network traffic and weblog analysis for which Hadoop comes as a suggested solution. It can
efficiently process the Netflow data collected from routers, switches or even from website access logs at
fixed intervals.
Taming the PDB: Resource Management and Lockdown ProfilesMarkus Flechtner
Managing a large multitenant database with many pluggable database can be a difficult task. Many PDBs fight for the server resources like I/O, CPU and memory. It can be difficult to keep the SLAs agreed with your customers. Oracle Database 12c Release 2 offers improvments in resource management for PDBs and a new feature called „Lockdown Profiles“ which helps you to limit the available features on PDB level. The talk shows the various areas of these two features in a CDB environment and shows how they will help you managing a multitenant environment. And both features can help even with a single-tenant database.
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...iosrjce
IOSR Journal of Computer Engineering (IOSR-JCE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of computer engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in computer technology. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
Map-Reduce Synchronized and Comparative Queue Capacity Scheduler in Hadoop fo...iosrjce
IOSR Journal of Computer Engineering (IOSR-JCE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of computer engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in computer technology. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
KEAC - Whitepaper - Kayxo Exchange Adapter for BizTalk Kayxo
The Kayxo Exchange Appliance Connector (KEAC) for Google Search Appliance extends the Google Search Appliance, enabling users to search also for emails, contacts, appointments, tasks and documents stored in Microsoft Exchange, according each item’s security.
RESEARCH ON DISTRIBUTED SOFTWARE TESTING PLATFORM BASED ON CLOUD RESOURCEijcses
In order to solve the low efficiency problem of large-scale distributed software testing , CBDSTP(
Cloud-Based Distributed Software Testing Platform) is put forward.This platform can provide continous
integration and automation of testing for large software systems, which can make full use of resources on
the cloud clients, achieving testing result s in the real environment and reasonable allocating testing jobs,
to resolve the Web application software configuration test, compatibility test and distributed test problems,
to reduce costs, improve efficiency. Through making MySQL testing on this prototype system, the
verification is made for platform architecture and job allocation effectiveness.
The Oracle GoldenGate software package delivers low-impact, real-time data integration and transactional data replication across heterogeneous systems for continuous availability, zero-downtime migration, and business intelligence.
Join the Webinar to learn Golden Gate 12c New Features
• Expanded heterogeneous Support
• Multitenant Container Database (CDB) Support
• Oracle Universal Installer (OUI) Support
• Support for Public and Private Clouds
• Integrated Replicat
• Security
• Coordinated Replicat
• New 32K VARCHAR2 Support
• High Availability (HA) enhancements
• Support for Other Oracle products
• Improvements to feature Functionality
Hadoop World 2011: Hadoop Gateway - Konstantin Schvako, eBayCloudera, Inc.
Access to Hadoop clusters through dedicated portal nodes (typically located behind firewalls and performing user authentication and authorization) can have several drawbacks -- as shared multitenant resources they can create contention among users and increase the maintenance overhead for cluster administrators. This session will discuss the Gateway system, a cluster virtualization framework that provides multiple benefits: seamless access from users’ workplace computers through corporate firewalls; the ability to failover to active clusters for scheduled or unscheduled downtime, as well as the ability to redirect traffic to other clusters during upgrades; and user access to clusters running different versions of Hadoop.
With the new Application Containers feature in Oracle Database 12c Release 2, Oracle has opened the mechanisms of the container database architecture for applications. Now in a container database application, containers can be defined and application databases can share the data model and common data. This simplifies the installation and the upgrade these applications.
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...iosrjce
IOSR Journal of Computer Engineering (IOSR-JCE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of computer engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in computer technology. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
Map-Reduce Synchronized and Comparative Queue Capacity Scheduler in Hadoop fo...iosrjce
IOSR Journal of Computer Engineering (IOSR-JCE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of computer engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in computer technology. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
KEAC - Whitepaper - Kayxo Exchange Adapter for BizTalk Kayxo
The Kayxo Exchange Appliance Connector (KEAC) for Google Search Appliance extends the Google Search Appliance, enabling users to search also for emails, contacts, appointments, tasks and documents stored in Microsoft Exchange, according each item’s security.
RESEARCH ON DISTRIBUTED SOFTWARE TESTING PLATFORM BASED ON CLOUD RESOURCEijcses
In order to solve the low efficiency problem of large-scale distributed software testing , CBDSTP(
Cloud-Based Distributed Software Testing Platform) is put forward.This platform can provide continous
integration and automation of testing for large software systems, which can make full use of resources on
the cloud clients, achieving testing result s in the real environment and reasonable allocating testing jobs,
to resolve the Web application software configuration test, compatibility test and distributed test problems,
to reduce costs, improve efficiency. Through making MySQL testing on this prototype system, the
verification is made for platform architecture and job allocation effectiveness.
The Oracle GoldenGate software package delivers low-impact, real-time data integration and transactional data replication across heterogeneous systems for continuous availability, zero-downtime migration, and business intelligence.
Join the Webinar to learn Golden Gate 12c New Features
• Expanded heterogeneous Support
• Multitenant Container Database (CDB) Support
• Oracle Universal Installer (OUI) Support
• Support for Public and Private Clouds
• Integrated Replicat
• Security
• Coordinated Replicat
• New 32K VARCHAR2 Support
• High Availability (HA) enhancements
• Support for Other Oracle products
• Improvements to feature Functionality
Hadoop World 2011: Hadoop Gateway - Konstantin Schvako, eBayCloudera, Inc.
Access to Hadoop clusters through dedicated portal nodes (typically located behind firewalls and performing user authentication and authorization) can have several drawbacks -- as shared multitenant resources they can create contention among users and increase the maintenance overhead for cluster administrators. This session will discuss the Gateway system, a cluster virtualization framework that provides multiple benefits: seamless access from users’ workplace computers through corporate firewalls; the ability to failover to active clusters for scheduled or unscheduled downtime, as well as the ability to redirect traffic to other clusters during upgrades; and user access to clusters running different versions of Hadoop.
With the new Application Containers feature in Oracle Database 12c Release 2, Oracle has opened the mechanisms of the container database architecture for applications. Now in a container database application, containers can be defined and application databases can share the data model and common data. This simplifies the installation and the upgrade these applications.
Best Practices for the Most Impactful Oracle Database 18c and 19c FeaturesMarkus Michalewicz
This presentation answers the question, “What’s new in Oracle Database 19c ?” in a slightly different way: by providing best practices and a deep dive into the most impactful high availability (HA), scalability, and lifecycle management features in Oracle Database 12c, 18c and 19c, including a short roadmap of features yet to come in the next generation Oracle Database.
This deck was first presented during OOW19 together with Mauricio Feria, who reported on two of his customers and how they have used Oracle Database HA features and Maximum Availability Architecture (MAA) to improve their businesses.
This presentation reveals many important aspects of the CUBRID Database, including its unique features, future roadmap, comparison with other databases, architecture, etc.
LOD2 plenary meeting in Paris: presentation of WP6: State of Play: LOD2 Stack Architecture, by Bert Van Nuffelen, Kurt De Muelenaere, Bastiaan Deblieck - TenForce.
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...Impetus Technologies
For Impetus’ White Papers archive, visit- http://lf1.me/drb/
This white paper talks about the design considerations for enterprises to run Hadoop as a shared service for multiple departments.
As Hadoop becomes more mainstream and indispensable to enterprises, it is imperative that they build, operate and scale shared Hadoop clusters. The design considerations discussed in this paper will help enterprises accomplish the essential mission of running multi-tenant, multi-use Hadoop clusters at scale.
The white paper talks about Identity, Security, Resource Sharing, Monitoring and Operations on the Central Service.
For Impetus’ White Papers archive, visit- http://lf1.me/drb/
Real world business workflow with SharePoint designer 2013Ivan Sanders
Automating business processes with SharePoint 2013 is a powerful way to increase efficiency within any organization. With SharePoint Designer 2013, no-code (or declarative) workflows can be built to run either SharePoint 2013 On-Premise or in the cloud with Office 365. In this session, we’ll develop an expense report workflow from beginning to end to show how SharePoint Designer Workflows are being used in business today.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Generating a custom Ruby SDK for your web service or Rails API using Smithyg2nightmarescribd
Have you ever wanted a Ruby client API to communicate with your web service? Smithy is a protocol-agnostic language for defining services and SDKs. Smithy Ruby is an implementation of Smithy that generates a Ruby SDK using a Smithy model. In this talk, we will explore Smithy and Smithy Ruby to learn how to generate custom feature-rich SDKs that can communicate with any web service, such as a Rails JSON API.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
The Art of the Pitch: WordPress Relationships and Sales
Ppdg Robust File Replication
1. Robust File Replication
PPDG Focus Meeting, January 10th
2002
PPDG-11 V0.4
Robust File Replication.................................................................................................................................................... 1
1 Introduction.................................................................................................................................................................... 1
2 Summary of Presentations............................................................................................................................................ 1
3 Summary of Discussion Sessions.................................................................................................................................6
4 Results of and Proposals from the Meeting...................................................................................................................8
5 Architecture Diagrams ................................................................................................................................................11
6 Appendix...................................................................................................................................................................... 14
1 Introduction
The Particle Physics Data Grid SciDAC Collaboratory Pilot includes as one of its core work areas “Robust data
movement and replication” (CS-5 and CS-6). The four participating Computer Science groups are developing Grid
middleware to address components or integrated solutions for these services. The six experiments are deploying file
replication services into production – starting from the use of generic FTP, through initial parallel-stream FTPs such as
bbFTP, gsiFTP, using catalogs of varying sophistication to track and manage the distributed file sets, and experiment
specific higher level components to accomplish end-to-end applications which users can invoke and with which end
users, developers and integrators can interact. PPDG sponsors these groups to integrate and deploy their
replication applications and to share functionality and performance requirements, experience and plans. PPDG then
acts to promote common components and interfaces, consistency and interoperability of appropriate middleware and
standards.
This PPDG report is the result of a one-day focus meeting on “Robust File Replication”. Appendix 1 gives the agenda
of and attendees at the meeting. The meeting reflected accomplishments from a lot of work on the part of all the
participating groups. There was a clear interest and preparedness to discuss across the groups future work, technical
and practical issues and directions.
This report is in 2 sections. It attempts to capture key points of the application and technology presentations to
provide the background to identify a list of future activities of the project, and necessary and relevant areas of work
which would benefit from future discussion. The report does not include the information presented in the talks at the
meeting. The reader is referred to the slides and documents posted off of the meeting agenda page for more detailed
information. http://www.ppdg.net/mtgs/10jan-02/agenda.htm
2 Summary of Presentations
2.1 JLAB
All software based on Web services.
Replica
Catalog
Deep name tree. Translation from GFN to SURL – Global File Name and
Site-url – host name of the site + url (which includes the protocol and the
site). This is a logical string which can be “redirected” to the actual physical
site. Intent is that naming semantics is links and collections.
Rejected globus replica catalog because it does not support deep trees. Not
a challenging database design to do this. Using mySQL. Performance not an
issue because can improve the h/w. End to end locking and transaction rate.
1
2. HRM Listener Application
level Agent
Glue between local site and global replica catalog. Listening to local HRM
actions and informs other services. A planner? An information server? Each
storage system has an HRM listener. HRM is part of the VO.
Where are there one to many? Switch from Grid to VO this needs more
thought. Wrapper on Jasmine. Soap + mysql
Replication
Service
handles requests to make replicas at a higher level than the replica catalog.
Handles space requests etc. File Client does not do this. File Transfer
Service does not manage space. Who does?
1. Recommend that a definition of web accessible services should be included in the PPDG Architecture. Is
this a minority opinion? Globus has stated it is a direction they are moving in. The next generation of replica
catalog is defined to be a web service.
2. How should PPDG be defining the web services interfaces? SRB will work with JLAB. Mapping between the
representations can be easily done. How does one define the meaning of the schema? Agree on a minimum
set? Can this be done as a joint effort or is it several parallel efforts. All results should be posted to the
PPDG web sites and comment on it as it is in progress rather than a “final draft for review”. Draft of JLAB
implementation is posted to the meeting web page.
3. GridFTP interaction with Storage Resource managers? Need a discussion with Ian, Carl and Arie. SRM
document addresses some of the issues.
4. Need to communicate error information back through the services and/or layers?
2.2 SRB
SRB Enhancements for BaBar:
SRB->HPSS Driver Glue Connect metadata in SRB DB to HPSS files
SRB Server Extend
Common
Middleware
Extension to use new driver to HPSS and make server support SLAC
Remote Proxy Glue
(DataCutter)
Access to and bundling of file transfers.
User Client BaBar
Replication Services
Logical Name Space Replication is a capability “in the logical name space”. Replication
integrated into SRB system. Locking done with timeouts. Inconsistencies
can occur.
Registration of Digital
Objects
Files, Blobs, Database command sequences, URLs. Can see information
from different databases.
Aggregation Container replication; synchronization; staging. Can have a Container that
represents a whole site.
Replica Creation Synchronous, Asynchronous – out of band. (from PPDG requirements)
Replica Access Automated fail over to alternate copy
Latency Management
Data Transport
Meta Data Transport
1. Remote Proxy – possibility that will need scheduling service, and mechanism for improving efficiency of
file transfers.
2. Need to access metadata independent of file access. Need to provide bulk metadata import and
registration. Discovery based on attributes.
3. Storage System access and data transport interface are site specific.
4. Any thoughts on linking the BaBar metadata catalogs Oracle and Objectivity ? Complex but has been
done with Objectstore.
2
3. 5. Asynchronous replica creation (k out of n is a success). using background service was not requested
by BaBar nor implemented. by SRB. Relation to partial result? Could benefit from more discussion.
6. Architecture
a. Storage Abstraction – is this /should this be a common components? How does it relate to the
HRM definition? Includes latency management.
b. Catalog Abstraction
2.3 Globus (Giggle/Grin)
First version of Replica Catalog and Management Services is in production as part of Globus V2.0 and integrated into
GDMP and EDG TestBed 1. The comments relate to the developments of the new components : Replica Location
Service (RLS) which augments the Replica Catalog, and Reliable File Transfer Service which is a component above
the File Transfer layer. The first prototype implementation of the RLS is scheduled for 4/02 and a production version for
integration with EDG TestBed 2 in 9/02.
Replica Catalog File attributes are kept in meta-data catalog which is outside the domain of the
Globus service?
Reliable Replication Combine storage system operations with replica catalog updates.
Replica Selection Estimate performance
Relies on Information Services
Replica Location
Service
Framework:
To an end user the functionality will appear as equivalent to the set of Replica
Catalog, Replica Selection and Replication Managament
Reliable Local State
Global State with
Relaxed
Consistency
Reliable File
Transfer Service
Reliable transfer of byte streams. Built on top of GridFTP.
http://www.mcs.anl.gov/~madduri/RFT.html
Reliable Replication
Service
Reliable Replication Service. Who is responsible for establishing the reliability,
verifying and determinine the Catalog consistency. Catalogs within RLS include
the Storage System catalog.
1. New implementation of Replica Catalog supports logical files in several collections and containers?)
2. Name Space. Could one map to the UNIX file system name space? Is this something that PPDG wants
to input to? WP2: Does one need to define Name Space semantics? Is the definition of database
tables sufficient – ie arbitrary set of attributes that defines a name?
3. “Collection” use overload.
a. Container/Aggregation. Same as a data object. Clusters.
b. Selection Set/Collections. Logical organization.
These are orthogonal.
4. Could Globus interface discuss with JLAB and SRB before completing the definition of the interfaces for
RLS?
5. Difference between Replica Management and RLS was not completely clear?
6. Impact on End User of different consistency levels. Should be none except for performance? Depends
on the user API. User gets “probability” that file is in the stated location. This is always true.
a. Does End User gets information that is “Wrong”? - possibly. But this is true given errors that
can occur with completely design which guarantees consistency?
b. Does End User always get correct information but performance is affected? Yes.
7. Semantics of the Hints/Location Service needs to be separate from those of the File Delivery Service.
8. WP2 has seen no performance issues with current version of Replica Catalog.
3
4. 2.4 GDMP(CMS)
Grid Data Mirroring Package. V2.0 is included in EDG TestBed 1 and V2.x will be in VDT 1.0.
Publish/Subscription
Manager
GDMP Local catalogs – text files - keep lists and state.
Replica Catalog Globus Updated when replica “pulled”. Can be used as a push model with the GDMP layer
File Copier GridFTP Interfaces to the Storage System
Storage System
Interface
Looking at the HRM. How does the interaction happen?
Replica Optimizer WP2 Being designed. Is this a potentially “common component”. Workshop is at CERN
week of Mar 15th
1. GDMP works on Containers as well as single files. This is an enhancement to the Globus Replica
catalog/management.
2. Error recovery use cases.
a. May republish a file that already succeeded. Globus replica catalog refuses duplicate entry of
logical file.
b. May be knowledge in the catalog you don’t know about. Should protocol include a Transaction
Index and 2 phase commit?
c. Where is the responsibility to determine validity of catalog?
d. Is GDMP functionality replaced by Globus Reliable Location Service in the future? Not
completely. Will need the Publish/Subscribe layer.
2.5 MAGDA(Atlas)
MAGDA is being used and further developed by ATLAS as a vertically integrated framework available for testing,
experiment development and production use. Gsiftp and scp are used for the file copy, mysql as the database. To
date other components are ATLAS developed.
Logical File Name
Space
Supports collections and container. Arbitray string. Name is unique in a VO,
includes Replica Number.
File Catalog Mysql database. Mysql accelerator written by ATLAS for sets of database updates.
Replica catalog loader written but not tested. No transaction locking to date.
Storage System Data repository. Site + Location. Host can access a set of sites.
File Discovery Agent Spider finds files and registers them
Replication Service Replication Operation done by tasks. (Data Placement Jobs). Master Instance is a
requirement – addresses consistency issue. Use scp/gsiftp. Gdmp integration
underway. Cost of access – only allow access from local cache and site.
Automated optional delete of replica.
User Web Interface Web pages for requests and status
1. Consistency maintenance – Assured Current.
2. Trusted Files. Supports new versions of files which must be published. Can one rephrase this?
3. HEMP – Hybrid Event Store Metadata Prototype. Related to Data Signature work.
4. Replication Jobs. Data Movement scheduling needs a fuller discussion.
GDMP Issues:
4
5. 1. One root disk directory per site
2. Subscription updates bring in all new data for a site
3. File collections not used
4. LFN fixed as ‘dir/filename’ (RC constraint)
5. Doesn’t catalog or directly manage files in MSS
6. Wr
7. ite
8. access to tmp, etc disk areas required for all GDMP users
7. System state info (in files) only available locally
General discussion topics:
1. Policies for Storage and Access.
2. User view of MAGDA? Similarity of services with SAM and BaBar needs?
2.6 SAM(D0)
SAM is in production use by D0 as an integrated data grid system. The file handling, replication, routing services were
developed some time ago. The presentation focused on some of the robustness features in the file copying
components and deployment of the integrated distributed system – it is not a complete view.
Failover If error from one replica automatically fail over to another
Cleanup Release resources if task or job fails. Detection of abandoned jobs.
Responses to Errors Timeout if resources held too long without action.
Node error results in rerouting of the data to healthy nodes
Exit handler in User process which calls DH system
Resilience Automatic restart of servers and jobs. Retries of replication. Separate movement of
data itself from that of the metadata to separate dependence on storage system
and data catalogs.
Performance Tuning Parallelize database access layer.
Integration Features Validation agents. Error message translation and interpretation at Component
Interfaces. Tunable timeouts at every interface. (No checksums.)
1. Timeouts as an error mechanism. Pluses and minuses.
2. Unexpected/incorrect behaviour of layers depending on (e.g. file copier) takes a lot of time and work to
code for/around.
3. Complete logs help debugging and diagnosis.
2.7 STAR
STAR is working with the SRM project on the integration of the HRM implementation of the SRM standard in an end –
to-end application.
Replica Catalog mysql
File transfer Globus GridFTP
Storage Management SRM SRM-HRM. Retries work when there is a storage system error.
2.8 Babar
BaBar has a prototype of database replication using the SRB replication services. This prototype is being modified to
separate the catalog information in MCAT - leaving the core replication schema in MCAT and the BaBar extensions in
another DB.
5
6. 2.9 Related Work - Condor
Condor developments were not reported in the meeting, are related to the topics at hand and are candidates for PPDG
work: Nest http://www.nestproject.org/ , ftp-lite http://www.cs.wisc.edu/condor/ftp_lite and the pluggable file
system http://www.cs.wisc.edu/condor/pfs and kangaroo http://www.cs.wisc.edu/condor/kangaroo
3 Summary of Discussion Sessions
These notes are from the scheduled and impromtu discussion sessions. As such they are incomplete and
reflect periods of time when the notetakers were otherwise engaged.
A JOB is a schedulable unit or a schedulable transaction.
3.1 Interfaces to Robust File Replication services:
MAGDA, Globus, SRB, SAM –
Web Services for this uniformity? Or Protocol Question – commands and/or attributes that are included.
Do we want to retrofit and/or wrap existing systems with the same interface definition but different
implementation.
Are there separate services for Replica Catalog interface and/or Replica Services.
Semantics of replica systems.
Assume live in a heterogeneous world and one implementation can talk to another implementation. May
require reimplementation.
EDG is not trying to solve the problems “of the whole world”. Bottom up approach and identify components.
Core set of capabilities.
For JLAB Publish/Subscribe is a Replica Policy.
Low level API for file transfer should not be dependent on whether being used in Replication or not.
Where does bulk transfer of data – container of containers. Is this a separate concept or not? Does it affect
the semantics and model of consumption of the data. Is there lazy consumption or not? Where do the policy
and planning interfaces occur? Can a file be regarded as a container and it is then decomposed and partially
copied – this is a task for SRB ASCI project.
How high up the service layers are we going to go? What are the collective and application level
components. Do we want/need to address the end user layers?
With reference to DGRA V2.09
User Interface
Replica Management 9.1 register, move, copy
Replica Catalog Service 7.3 “catalog-only” requests and
collection definition
Local Replica Catalog 5.5.2
Storage Resource
(system, element)
5.1 storage requests and information
Reliable Transfer 9.1 copy only requests
Publish/Subscribe
6
7. Is there a consistency mechanism as part of the API? Validation and transaction API? What is the semantic
for this?
How to address fact the “place to memory’ and ‘place to disk’ can have same semantics but are certainly not
replaceable and are not necessarily interoperable.
Need to discuss the State of the file and as well as the Status of the replication and file storage/copy.
Coupling between Storage Element and Virtual Storage Element or Replica Catalog. Need to be careful
about wanting a full file system semantics of a unix file system.
Are people prepared to get together to work out the overlap and commonality between current
implementations. Then deliver this to PPDG. Should not take more than 2 months. Not clear what benefit
this would have – we have representatives of all the implementations available to review any common
proposals.
RLS. Local Catalog in next week or 2. Index Node specification – prototype version by the end of March.
Globus Replica Management API:
http://www-unix.globus.org/api/c/globus_replica_management/html/index.html
3.1.1 Redirection Proposal from BaBar
The BaBar redirection requirement and implementation proposal is posted off of the agenda web page. It has
been previously discussed in PPDG meetings and was revisited here in light of the next round of
Globus/WP2 design and implementation work:
1. Redirection is part of the WP2 design for TestBed 2.
2. RLS allows a first level of indirection. Need to leave protocol open to allow later addition of this
redirection capability. This is has been agreed to for a while, but needs detailed implementation details.
3. For web services interface – redirection is explicit in that there is a 2 step process for accessing the
byte stream in the SRM document.
4. Manual lookups – always doing a redirection.
Agreed that this issue is being addressed and the next discussion should be to review the implementation
after the first prototype version of RLS is released.
3.2 Errors, Status, Error Handling, Reliability
Discussion was driven by the slides posted of the agenda web page.
Should one provide a layer that takes all error information and interprets it. Can design a “perfect error
system” will always have to translate the information for some other component.
Strings vs Error Codes – give the Details or the Essence. Maximum length of string to have user read it. So
“Summary String” and “Detailed String”.
Need to address Status from success as well as failure e.g retries.
What is in the error and status handling that is better in the information/monitoring system?
Diagnosis and response can/should/is an independent activity? Who uses the information for what –
debugging , diagnosis, human response.
7
8. PPDG should decide what we want to do about Error Handling? Agreement that this is an important area
which always takes much work for end-to-end application and distributed system integration and
deployment.
Server Process and/or Service Machine died in the middle of a catalog/database update. Details are
different although report to the user is the same.
Should system be robust to system administrator deleting a logical file somewhere. In Giggle can make sure
local catalog and local storage are consistent. This might be too costly? What happens if one loses a file?
Status e.g. how many retries, automated failover information, of successful operations also important.
Definition of file STATEs part of overall understanding of errror, status, consistency, robustness issues.
3.2.1 SAM
SAM status blocks were not included to date in the presentation. SAM keeps a nested stack of errors and
structures. All the information is contained in the structure. Ultimately printed as text.
http://d0db.fnal.gov/sam/doc/design/status.html , http://www.ppdg.net/mtgs/10jan-02/SAMErrorCode.idl.txt ,
http://www.ppdg.net/mtgs/10jan-02/SAM_Status.idl.txt . Examples:
>>>>>> Starting project with the Station
M aster
Defaulting to "new" dataset version
CORBA Exception, station is probably
dead (Minor: 0
Completed: COMPLETED_NO)
% CERR 11-Sep-2001 15:57:02 SAMManager:sammgr -
%ERLOG-w SAM: PROJECT MASTER:
Project master error caught in SAMManager::locatePM()!
Error message: Project master unreachable!
Contact sam-users@fnal.gov!
sammgr 11-Sep-2001 15:57:02 SAMManager:sammgr -
SAMManager:sammgr Waiting for the project master (no timeout).
%ERLOG-e UNKNOWN:
CorbaUtil::Resolve:
'/SAMStations/central-analysis/09_11_01_15_56:Project' not found
3.3 Interfaces on which Replication depends
The Data Grid Capabilities document (PPDG-8) was used as a basis for discussion. This document will be
recast into categories to map onto DGRA and MAGDA will be included.
Latency management – what are the technical details.
Robustness – capabilities not in common.
Asynchrony support
Consistency state.
Logical File Names:
1. Does Unix semantics Logical File Names follow through into functionality e.g. ACL for directory affects
ability to create new files. How does Authorization get affected? Is this part of the architecture/design.
Multi-part authorization process.
2. How does one do a Physics Meta-Data Query. Logical name space attributes or the name?
3. Are Names of Files “meaningful” or are the “strings that identify a set of meta-data”.
4. Does update of a file create a new entry? Should version be part of the significant name?
4 Results of and Proposals from the Meeting
8
9. 4.1 Acceptance of Documents:
The following PPDG documents were accepted. Comments, changes, new versions are anticipated. These
documents are PPDG project document deliverables in Common Services CS-7.
PPDG-10 Numeric Requirements for the Replica Catalog Service V0.2
PPDG-9 Common Storage Resource Manager Operations V1.0
PPDG-8 Data Grid Implementations - Comparison of Capabilities, V6
This paper is proposed to be PPDG-11 - Robust File Replication, PPDG Focus Meeting Report
4.2 Statements of Direction:
There has clearly been a lot of progress in the design, implementation and deploying of Replication Services in the
PPDG experiments over the past year. Successes include:
a. End to end application tests by all experiments.
b. Delivery and prototype use of new Globus Replication services and extension of SRB and HRM
common services.
c. Accepted common terminology and use of Data Grid Reference Architecture definitions.
d. Documenting performance requirements and system capabilities.
e. Progress on more detailed interface, architecture and protocol definitions
f. Inter-team discussions on new designs and interfaces.
PPDG will continue to collaborate with EDG on GDMP in its developments for WP2 TestBed 2 and integration with
Giggle. Ppdg-exec should discuss this with EDG/WP2, CMS and Globus as part of PPDG Year 2 planning.
a. Need to define which pieces to leave as GDMP specific layer. Is GDMP still a “CMS specific”
PPDG project activity? For EDG it is not CMS specific.
b. Need to address the issue of GDMP V2 support as V3 is developed and deployed.
.
JLAB/SRB Project Activity service specification will be the nascent protocol definition for Replica Management for
PPDG review/input and adoption. This is a possible discussion topic for the Feb PPDG collaboration meeting if there is
time. It is possible that Globus might be able to consider contributing to and/or reviewing this.
While there is continued concern at multiple implementations in experiments of file transfer and replica management it
is clear that during this phase of the project it is most constructive to be exploring different ideas and directions as a
precursor to moving towards more commonality. We expect continued discussion of this issue.
There is still significant work to be done to have a Robust File Replication system that meets the needs of all the PPDG
application groups.
4.3 Action Items:
SRB/JLAB interface to Storage Element and
Replication Management (web service definition)
JLAB,SRB document first draft 2/20/02
GridFTP interaction with Storage Resource
managers
ppdg-exec phone con with Carl,
Ian, Bill, Arie to initiate the
discussion
Container and Collection consistency in use ppdg-exec review PPDG documents 2/20/02
GDMP and RLS issues (ATLAS GDMP issues,
PPDG Year 2 planning, Master Replica.
Ppdg—exec phone con with GDMP,
WP2, CMS, ATLAS, Globus, Andy
2/20/02
Review next version of Globus Replication
development
Agenda of PPDG phone con. 1/30/02
Review Local Replica Catalog Interface Agenda of PPDG phone con – AC, 1/30/02
9
10. SM
Data movement scheduling Agenda of PPDG phone con Before
4/02
Error Reporting, Handling and Response in the
PPDG Environment
2 page paper from ppdg-exec.
Agenda of PPDG phone con
2/20/02
3.02
Revisit outcomes Another focus meeting Decide in
April.
10
11. 5 Architecture Diagrams
5.1 SRB
11
Information &
Monitoring
Information &
Monitoring
Logical name SpaceLogical name Space Grid SchedulerGrid Scheduler
Replica OptimizationReplica Optimization
Replica AttributesReplica Attributes
Job ManagementJob Management
Local ApplicationLocal Application
Experiment DatabasesExperiment Databases
Configuration
Management
Configuration
Management
Node
Installation &
Management
Node
Installation &
Management
Monitoring
and
Fault Tolerance
Monitoring
and
Fault Tolerance
Resource
Management
Resource
Management
Fabric Storage
Management
Fabric Storage
Management
Grid
Experiment Computing
BaBar Grid
Data ManagementData Management Metadata
Management
Metadata
Management
Object to File
Mapper
Object to File
Mapper
Computing
Element
Services
Computing
Element
Services
Authorisation,
Authentication
and Auditing
Authorisation,
Authentication
and Auditing
Catalog
Management
(catalog
manipulation)
Catalog
Management
(catalog
manipulation)
Storage
Services
(storage
abstraction)
Storage
Services
(storage
abstraction)
SQL
Database
Service
SQL
Database
Service
Service Index
(URL /
command
registration)
Service Index
(URL /
command
registration)
SRB mapped to PPDG/DGRA
Architecture
Fabric and
Connectivity
Resource
Collective
Domain
Application FrameworkApplication Framework
Consistency
(metadata / data)
(latency
management /
metadata)
Consistency
(metadata / data)
(latency
management /
metadata)
Information &
Monitoring
Information &
Monitoring
Logical name SpaceLogical name Space Grid SchedulerGrid Scheduler
Replica OptimizationReplica Optimization
Replica AttributesReplica Attributes
Job ManagementJob Management
Local ApplicationLocal Application
Experiment DatabasesExperiment Databases
Configuration
Management
Configuration
Management
Node
Installation &
Management
Node
Installation &
Management
Monitoring
and
Fault Tolerance
Monitoring
and
Fault Tolerance
Resource
Management
Resource
Management
Fabric Storage
Management
Fabric Storage
Management
Grid
Experiment Computing
BaBar Grid
Data ManagementData Management Metadata
Management
Metadata
Management
Object to File
Mapper
Object to File
Mapper
Computing
Element
Services
Computing
Element
Services
Authorisation,
Authentication
and Auditing
Authorisation,
Authentication
and Auditing
Catalog
Management
(catalog
manipulation)
Catalog
Management
(catalog
manipulation)
Storage
Services
(storage
abstraction)
Storage
Services
(storage
abstraction)
SQL
Database
Service
SQL
Database
Service
Service Index
(URL /
command
registration)
Service Index
(URL /
command
registration)
SRB mapped to PPDG/DGRA
Architecture
Fabric and
Connectivity
Resource
Collective
Domain
Application FrameworkApplication Framework
Consistency
(metadata / data)
(latency
management /
metadata)
Consistency
(metadata / data)
(latency
management /
metadata)
Unix
Shell
Java, NT
Browsers
WebProlog
Predicate
SDSC Storage Resource Broker & Meta-data Catalog
Archives
HPSS, ADSM,
UniTree, DMF
Databases
DB2, Oracle,
Postgres
File Systems
Unix, NT,
Mac OSX
Application
HRM
Clients
Servers
Storage AbstractionCatalog Abstraction
Databases
DB2, Oracle, Sybase
C, C++,
Libraries
Logical Name
Space
Latency
Management
Data
Transport
Metadata
Transport
Consistency Management / Authorization-Authentication
Prime
Server
Linux
I/O
DLL /
Python
Unix
Shell
Unix
Shell
Java, NT
Browsers
WebProlog
Predicate
SDSC Storage Resource Broker & Meta-data Catalog
Archives
HPSS, ADSM,
UniTree, DMF
Archives
HPSS, ADSM,
UniTree, DMF
Databases
DB2, Oracle,
Postgres
Databases
DB2, Oracle,
Postgres
File Systems
Unix, NT,
Mac OSX
File Systems
Unix, NT,
Mac OSX
Application
HRM
Clients
Servers
Storage AbstractionCatalog Abstraction
Databases
DB2, Oracle, Sybase
C, C++,
Libraries
C, C++,
Libraries
Logical Name
Space
Latency
Management
Data
Transport
Metadata
Transport
Consistency Management / Authorization-Authentication
Prime
Server
Linux
I/O
DLL /
Python
12. 5.2 SAM
5.3 JLAB
12
Fabric
Tape
Storage
Elements
Request
Formulator and
Planner
Client Applications
Compute
Elements
Indicates component that will be replaced
Disk
Storage
Elements
LANs and
WANs
Resource and
Services Catalog
Replica
Catalog
Meta-data
Catalog
Authentication and Security
GSISAM-specific user, group, node, station registration Bbftp ‘cookie’
Connectivity and Resource
CORBA UDP File transfer protocols -
ftp, bbftp, rcp GridFTP
Mass Storage systems protocols
e.g. encp, hpss
CollectiveServices
Catalog
protocols
Significant Event Logger Naming Service Database ManagerCatalog Manager
SAM Resource Management
Batch Systems - LSF, FBS, PBS,
Condor
Data MoverJob Services
Storage ManagerJob ManagerCache ManagerRequest Manager
“Dataset Editor” “File Storage Server”“Project Master” “Station Master” “Station Master”
Web
Python codes, Java codes
Command line
D0 Framework C++ codes
“Stager”“Optimiser”
Code
Repostory
Name in “quotes” is SAM-given software component name
or addedenhanced using PPDG and Grid tools
Fabric
Tape
Storage
Elements
Request
Formulator and
Planner
Client Applications
Compute
Elements
Indicates component that will be replaced
Disk
Storage
Elements
LANs and
WANs
Resource and
Services Catalog
Replica
Catalog
Meta-data
Catalog
Authentication and Security
GSISAM-specific user, group, node, station registration Bbftp ‘cookie’
Connectivity and Resource
CORBA UDP File transfer protocols -
ftp, bbftp, rcp GridFTP
Mass Storage systems protocols
e.g. encp, hpss
CollectiveServices
Catalog
protocols
Significant Event Logger Naming Service Database ManagerCatalog Manager
SAM Resource Management
Batch Systems - LSF, FBS, PBS,
Condor
Data MoverJob Services
Storage ManagerJob ManagerCache ManagerRequest Manager
“Dataset Editor” “File Storage Server”“Project Master” “Station Master” “Station Master”
Web
Python codes, Java codes
Command line
D0 Framework C++ codes
“Stager”“Optimiser”
Code
Repostory
Name in “quotes” is SAM-given software component name
or addedenhanced using PPDG and Grid tools
File Client
Meta Data Catalog
Replica Catalog
HRM++ Service
Replication Service
Storage Resource
File Server(s) HRM Listener
Web Services
Single Site
Data Grid Web Services ArchitectureData Grid Web Services Architecture
File Client
Meta Data Catalog
Replica Catalog
HRM++ Service
Replication Service
Storage Resource
File Server(s) HRM Listener
Web Services
Single Site
File Client
Meta Data Catalog
Replica Catalog
HRM++ Service
Replication Service
Storage Resource
File Server(s) HRM Listener
Web Services
Single Site
Data Grid Web Services ArchitectureData Grid Web Services Architecture
13. 5.4 WP2
5.5 ATLAS
13
WP2 Replication Services - Overview
Magda Architecture
Location
Location
Location
SiteLocation
Location
Location
Site
Location
Location
Location
SiteLocation
Location
Location
Site
Location
Location
Location
Site Host 2
Location
Location
Cache
Disk
Site
Location
Location
Location
Mass
Store
Site Source to cache
stagein
Source to dest
transfer
MySQLSynch via DB
Host 1
Replication taskReplication task
Collection of logical
files to replicate
Spider
Spider
scp, gsiftp
Register replicas
Catalog updates
14. 6 Appendix
6.1 Jan 10th
Meeting Agenda
GriPhyN/PPDG Data Grid
Architecture, Toolkit, and Roadmap
v2.09, v2.07s
EDG Work Package 2
Replication Requirements: 6/01; 1/02
Storage Resource Management Interface V1.0
.
Time Topic
Speaker or
Discussion
Documentation /
Presentation
Ongoing Work
9:00am Welcome Chip Watson
9:10am Introduction Ruth Pordes Slides
9:15am JLAB Chip Watson
Talk: Web services for replicated file
management, A data analysis grid
9:30am SRB Reagan Moore Talk :SRB and the discussion session
9:45am
Globus
(Giggle/Grin)
Ann Chervenak
Draft Paper. GGF presentation, Reliable
File Transfer
11:00am GDMP(CMS) Heinz Stockinger Talk: GDMP documentation
11:15am MAGDA(Atlas) Torre Wenaus Talk: Magda Documentation
11:30am SAM(D0) Vicky White Talk: SAM home page
11:45am STAR Eric Hjort Talk
12:00pm Babar Adil Hasan SRB in Babar
12:30pm Lunch
1.30pm
Common interfaces
to services this layer
provides
Andy
Hanushevsky
Requirements/Interfaces for: catalog;
queueing of replication requests; reliable
execution of these requests; replication
policy specification
Redirection issue: Paper, Proposal
2:30pm
Status, Errors,
Asynchrony
Doug Thain Talk
3:30pm
Common interfaces
to provider services
Reagan Moore
Talk (second half of slides above)
Requirements/Interfaces to the services
robust replication consumes -- HRM, file
transfer Comparison of data grid
capabilities
4:30pm Break
5:00pm
What has been
learned, next steps,
goals etc
6:00pm Dinner/End
6.2 Jan 10th
Participants
14
15. Walt Aker – jlab Reagan W. Moore – SDSC
Bill Allcock – ANL Richard Mount (VRVS) - SLAC
Jie Chen - JLAB Shazhad Muzaffar – Fermilab
Ying Chen – JLAB Ruth Pordes – Fermilab
Ann Chervenak – ISI Heinz Stockinger – CERN
Peter Couvares – Uwisc Doug Thain – Uwisc
Ewa Deelman – ISI Yee-Ting Li (VRVS, ucl , uk)
Andy Hanushevsky – SLAC Chip Watson – JLAB
Bryan Hess – JLAB Torre Wenaus – BNL
Eric Hjort – LBL Vicky White – DOE
Andy Kowalski – JLAB Mike Wilde – ANL
Miron Livny – Uwisc Bing Zhu – SDSC
15