This document discusses integrating the Generally Accepted Recordkeeping Principles (GARP®) framework with electronic discovery (eDiscovery) best practices. It provides an overview of GARP® and eDiscovery, focusing on the people, processes, and technologies involved. It then discusses how to integrate the two through case studies on implementing information governance programs in different organizations. The key takeaway is to understand both GARP® and eDiscovery in order to effectively integrate the two domains through people, processes, and technologies.
Securing Enterprise Healthcare Big Data by the Combination of Knox/F5, Ranger...DataWorks Summit
Data security is critical to the success of large enterprises such as Mayo Clinic (MC). There is no exception for healthcare data stored on the enterprise Big Data platforms. At MC, healthcare Big Data ingestion, storage, processing and analytics are all in the enterprise-secured environments including Sandbox, Dev, Int/Test and Prod Hadoop clusters. The primary data security in the enterprise-secured Hadoop clusters has been achieved at MC by the combination of Knox Gateway/F5 Balancer, Ranger authorization/auditing, Two Factor local authentication (TFA) and Kerberos authentication that are coupled to MC Active Directory and LDAP. In other words, any major HDFS, HBase and Hive healthcare data operations at MC have to go through the dedicated Knox Gateway or F5 balancer (for Knox HA) via Rest API, which interacts with Ranger and other primary security components involved. The data security on the Big Data platforms at MC is going to be strengthened by the on-going network segmentation and SSL enabling on the related Hadoop ecosystem components. The above approaches adopted on MC Big Data platforms have significantly improved the security of data for the success of MC Big Data program although the data need high-skilled clients or applications to use.
Apache Atlas. Data Governance for Hadoop. Strata London 2015Sean Roberts
Apache Hadoop is being adopted across all industries for its ability
to store and process an abundance of new types of data in a modern data architecture. But this “Any Data” architecture presents a challenge when organizations must reconcile data management realities and as they bring existing and new data from disparate platforms under management.
Apache Atlas proposes to provide governance capabilities in Hadoop that use both a prescriptive and forensic models enriched by business taxonomical metadata. It is designed to exchange metadata with other tools and processes within and outside of the Hadoop stack, thereby enabling platform-agnostic governance.
Blockchain & Security in Oracle by Emmanuel AbiodunVishwas Manral
Enterprise customer adoption of blockchain technologies and Fabric, in particular, depends on simplifying the deployment and provisioning of all the underlying dependencies, creating a resilient and supportable platform for development and day-to-day operations, rapidly integrating the applications that interact with Fabric smart contracts to run transactions or query the ledger in a secure and compliant manner. This session will describe how Hyperldeger Fabric can be deployed into and leverage modern cloud platform capabilities while keeping governance, compliance, and security in-tact. The technical requirements and integration points will be discussed and specific areas illustrated based on Oracle Cloud Infrastructure.
Securing Enterprise Healthcare Big Data by the Combination of Knox/F5, Ranger...DataWorks Summit
Data security is critical to the success of large enterprises such as Mayo Clinic (MC). There is no exception for healthcare data stored on the enterprise Big Data platforms. At MC, healthcare Big Data ingestion, storage, processing and analytics are all in the enterprise-secured environments including Sandbox, Dev, Int/Test and Prod Hadoop clusters. The primary data security in the enterprise-secured Hadoop clusters has been achieved at MC by the combination of Knox Gateway/F5 Balancer, Ranger authorization/auditing, Two Factor local authentication (TFA) and Kerberos authentication that are coupled to MC Active Directory and LDAP. In other words, any major HDFS, HBase and Hive healthcare data operations at MC have to go through the dedicated Knox Gateway or F5 balancer (for Knox HA) via Rest API, which interacts with Ranger and other primary security components involved. The data security on the Big Data platforms at MC is going to be strengthened by the on-going network segmentation and SSL enabling on the related Hadoop ecosystem components. The above approaches adopted on MC Big Data platforms have significantly improved the security of data for the success of MC Big Data program although the data need high-skilled clients or applications to use.
Apache Atlas. Data Governance for Hadoop. Strata London 2015Sean Roberts
Apache Hadoop is being adopted across all industries for its ability
to store and process an abundance of new types of data in a modern data architecture. But this “Any Data” architecture presents a challenge when organizations must reconcile data management realities and as they bring existing and new data from disparate platforms under management.
Apache Atlas proposes to provide governance capabilities in Hadoop that use both a prescriptive and forensic models enriched by business taxonomical metadata. It is designed to exchange metadata with other tools and processes within and outside of the Hadoop stack, thereby enabling platform-agnostic governance.
Blockchain & Security in Oracle by Emmanuel AbiodunVishwas Manral
Enterprise customer adoption of blockchain technologies and Fabric, in particular, depends on simplifying the deployment and provisioning of all the underlying dependencies, creating a resilient and supportable platform for development and day-to-day operations, rapidly integrating the applications that interact with Fabric smart contracts to run transactions or query the ledger in a secure and compliant manner. This session will describe how Hyperldeger Fabric can be deployed into and leverage modern cloud platform capabilities while keeping governance, compliance, and security in-tact. The technical requirements and integration points will be discussed and specific areas illustrated based on Oracle Cloud Infrastructure.
Designing Effective Storage Strategies to Meet Business NeedsBrian Anderson
In this presentation I presented ideas on designing a modern tiered storage infrastructure. I covered the basic strategies and requirements of tiers 1/2/3, object-based, cloud, and edge storage, along with the importance of categorizing data sets so that you can ultimately build a solid blueprint and business case. Other topics included transitioning to an effective tiered storage model, controlling storage growth, and emerging ideas and technologies for data storage.
Designing Effective Storage Strategies to Meet Business NeedsEagle Technologies
In this presentation we present EAGLE's ideas on designing a modern tiered storage infrastructure. We will cover the basic strategies and requirements of tiers 1/2/3, object-based, cloud, and edge storage, along with the importance of categorizing data sets so that you can ultimately build a solid blueprint and business case. Other topics include transitioning to an effective tiered storage model, controlling storage growth, and emerging ideas and technologies for data storage.
Hadoop based data Lakes have become increasingly popular within today’s modern data architectures for their ability to scale, handle data variety and low cost. Many organizations start slow with the data lake initiatives but as they grow bigger, they suffer with challenges on data consistency, quality and security, resulting in losing confidence in their data lake initiatives.
This talk will discuss the need for good data governance mechanisms for Hadoop data lakes and it relationship with productivity and how it helps organizations meet regulatory and compliance requirements. The talk advocates carrying a different mindset for designing and implementing flexible governance mechanisms on Hadoop data lakes.
Oracle Application User Group sponsored Collaborate 2009 Presentation 'Building a Practical Strategy for Managing Data Quality' by Alex Fiteni CPA, CMA
Solving Real Problems with Apache Spark: Archiving, E-Discovery, and Supervis...Spark Summit
Today there are several compliance use cases — archiving, e-discovery, supervision + surveillance, to name a few — that appear naturally suited as Hadoop workloads but haven’t seen wide adoption. In this talk, we’ll discuss common limitations, how Apache Spark helps, and propose some new blueprints as to how to modernize this architecture and disrupt existing solutions. Additionally, we’ll discuss the rising role of Apache Spark in this ecosystem; leveraging machine learning and advanced analytics in a space that has traditionally been restricted to fairly rote reporting.
EMC InfoArchive: a unified enterprise archiving platform that stores related structured data and unstructured content in a single, consolidated repository. This product enables corporations to preserve the value of enterprise information in a single, easily accessible, unified archive.
FLEXIBLE, UNIFIED ARCHIVE
InfoArchive ingests structured data and unstructured content in a single, unified archive, providing a holistic view of related information.
ARCHIVE FOR COMPLIANCE
Gain a long-term, compliant archive, meeting retention requirements and ensuring auditability, defensibility, and easy accessibility when needed.
COST REDUCTIONS
Leverage cost savings in infrastructure, administration, and operations by archiving static and valuable information. Achieve even more significant cost savings by using InfoArchive to decommission legacy applications.
ENTERPRISE SCALABILITY
Archive hundreds of billions of static records including transactions and statements with high-volume, rapid ingestion of data.
Archiving, E-Discovery, and Supervision with Spark and Hadoop with Jordan VolzDatabricks
Today, there are several compliance use cases ‒ archiving, e-discovery, supervision and surveillance, to name a few
‒ that appear naturally suited as Hadoop workloads, but haven’t seen wide adoption. In this session, you’ll learn about common limitations, how Apache Spark helps and some new blueprints for modernizing this architecture and disrupt existing solutions. Additionally, we’ll review the rising role of Apache Spark in this ecosystem, leveraging machine learning and advanced analytics in a space that has traditionally been restricted to fairly rote reporting.
This presentation focuses on exploring the challenges that require adoption of advanced methods and tools that measure environmental sustainability performance.
Multi-Tenancy in Data Lakes are on the rise. When looking at multi-tenancy from the lens of data governance, a lot is changing the landscape, and the way we have been operating with respect to the governance model probably needs a rethink. It is time to think of Governance and its various entities as a first-class citizen in data architecture and bake it as part of the platform. We will look at the various aspects of governance, extending to accommodate the growing compliance and regulatory requirements and suggestive architectural approaches to realize the same.
Implementing access and security controls across your applicationsDave Reik
SmartERP and SafePass discuss the process of establishing access and security policies within PeopleSoft and Oracle E-Business Suite. This session focuses on the areas of Oracle E-Business Suite and PeopleSoft that should be addressed as you implement continuous controls monitoring. We address how to implement Rules and Policies that cover Finance, Supply Chain and HR for a complete approach to securing all of your critical applications.
Get ahead of the cloud or get left behindMatt Mandich
An enterprise cloud computing strategy results in:
Broad consensus on goals and expected results of moving select processes to the cloud
Standardized, consistent approach to evaluating the benefits and challenges of cloud projects
Clear requirements for the negotiation and monitoring of partnerships with cloud service providers
Understanding and consensus on the enabling and managing role IT will play in future cloud initiatives
Goals and a roadmap for transforming internal IT from asset managers to service broker
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
Designing Effective Storage Strategies to Meet Business NeedsBrian Anderson
In this presentation I presented ideas on designing a modern tiered storage infrastructure. I covered the basic strategies and requirements of tiers 1/2/3, object-based, cloud, and edge storage, along with the importance of categorizing data sets so that you can ultimately build a solid blueprint and business case. Other topics included transitioning to an effective tiered storage model, controlling storage growth, and emerging ideas and technologies for data storage.
Designing Effective Storage Strategies to Meet Business NeedsEagle Technologies
In this presentation we present EAGLE's ideas on designing a modern tiered storage infrastructure. We will cover the basic strategies and requirements of tiers 1/2/3, object-based, cloud, and edge storage, along with the importance of categorizing data sets so that you can ultimately build a solid blueprint and business case. Other topics include transitioning to an effective tiered storage model, controlling storage growth, and emerging ideas and technologies for data storage.
Hadoop based data Lakes have become increasingly popular within today’s modern data architectures for their ability to scale, handle data variety and low cost. Many organizations start slow with the data lake initiatives but as they grow bigger, they suffer with challenges on data consistency, quality and security, resulting in losing confidence in their data lake initiatives.
This talk will discuss the need for good data governance mechanisms for Hadoop data lakes and it relationship with productivity and how it helps organizations meet regulatory and compliance requirements. The talk advocates carrying a different mindset for designing and implementing flexible governance mechanisms on Hadoop data lakes.
Oracle Application User Group sponsored Collaborate 2009 Presentation 'Building a Practical Strategy for Managing Data Quality' by Alex Fiteni CPA, CMA
Solving Real Problems with Apache Spark: Archiving, E-Discovery, and Supervis...Spark Summit
Today there are several compliance use cases — archiving, e-discovery, supervision + surveillance, to name a few — that appear naturally suited as Hadoop workloads but haven’t seen wide adoption. In this talk, we’ll discuss common limitations, how Apache Spark helps, and propose some new blueprints as to how to modernize this architecture and disrupt existing solutions. Additionally, we’ll discuss the rising role of Apache Spark in this ecosystem; leveraging machine learning and advanced analytics in a space that has traditionally been restricted to fairly rote reporting.
EMC InfoArchive: a unified enterprise archiving platform that stores related structured data and unstructured content in a single, consolidated repository. This product enables corporations to preserve the value of enterprise information in a single, easily accessible, unified archive.
FLEXIBLE, UNIFIED ARCHIVE
InfoArchive ingests structured data and unstructured content in a single, unified archive, providing a holistic view of related information.
ARCHIVE FOR COMPLIANCE
Gain a long-term, compliant archive, meeting retention requirements and ensuring auditability, defensibility, and easy accessibility when needed.
COST REDUCTIONS
Leverage cost savings in infrastructure, administration, and operations by archiving static and valuable information. Achieve even more significant cost savings by using InfoArchive to decommission legacy applications.
ENTERPRISE SCALABILITY
Archive hundreds of billions of static records including transactions and statements with high-volume, rapid ingestion of data.
Archiving, E-Discovery, and Supervision with Spark and Hadoop with Jordan VolzDatabricks
Today, there are several compliance use cases ‒ archiving, e-discovery, supervision and surveillance, to name a few
‒ that appear naturally suited as Hadoop workloads, but haven’t seen wide adoption. In this session, you’ll learn about common limitations, how Apache Spark helps and some new blueprints for modernizing this architecture and disrupt existing solutions. Additionally, we’ll review the rising role of Apache Spark in this ecosystem, leveraging machine learning and advanced analytics in a space that has traditionally been restricted to fairly rote reporting.
This presentation focuses on exploring the challenges that require adoption of advanced methods and tools that measure environmental sustainability performance.
Multi-Tenancy in Data Lakes are on the rise. When looking at multi-tenancy from the lens of data governance, a lot is changing the landscape, and the way we have been operating with respect to the governance model probably needs a rethink. It is time to think of Governance and its various entities as a first-class citizen in data architecture and bake it as part of the platform. We will look at the various aspects of governance, extending to accommodate the growing compliance and regulatory requirements and suggestive architectural approaches to realize the same.
Implementing access and security controls across your applicationsDave Reik
SmartERP and SafePass discuss the process of establishing access and security policies within PeopleSoft and Oracle E-Business Suite. This session focuses on the areas of Oracle E-Business Suite and PeopleSoft that should be addressed as you implement continuous controls monitoring. We address how to implement Rules and Policies that cover Finance, Supply Chain and HR for a complete approach to securing all of your critical applications.
Get ahead of the cloud or get left behindMatt Mandich
An enterprise cloud computing strategy results in:
Broad consensus on goals and expected results of moving select processes to the cloud
Standardized, consistent approach to evaluating the benefits and challenges of cloud projects
Clear requirements for the negotiation and monitoring of partnerships with cloud service providers
Understanding and consensus on the enabling and managing role IT will play in future cloud initiatives
Goals and a roadmap for transforming internal IT from asset managers to service broker
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
Full-RAG: A modern architecture for hyper-personalization
Integrating garp e_discovery
1. Integrating GARP® With Your
eDiscovery Best Practices
Steven C. Markey, MSIS, PMP, CISSP, CIPP, CISM, CISA, STS-EV, CCSK, CompTIA Cloud
Essentials
Principal, nControl, LLC
Adjunct Professor
President, Cloud Security Alliance – Delaware Valley Chapter (CSA-DelVal)
2. Integrating GARP® With eDiscovery
• Presentation Overview
– GARP® Overview
– eDiscovery Overview
– Integrating GARP® With eDiscovery
– Use Case 1
– Use Case 2
– GARP® Supplements
3. Integrating GARP® With eDiscovery
• GARP® Overview
– What is it?
• Information Governance Framework
– Phases
• Accountability
• Transparency
• Integrity
• Protection
• Compliance
• Availability
• Retention
• Disposition
5. Integrating GARP® With eDiscovery
• eDiscovery Overview
– What Is It?
• Electronic Discovery
• Electronically Stored Information (ESI)
– Who Does It Involve?
• People
• Process
• Technology
6. Integrating GARP® With eDiscovery
• eDiscovery Overview
– People
• Internal
– Records & Information Management (RIM)
– Internal Counsel/Legal/Compliance
– IT
• External
– External Counsel
– Consultants/Contractors
18. Integrating GARP® With eDiscovery
• eDiscovery Cloud Solutions
– Software as a Service (SaaS)
– Platform as a Service (PaaS)
– Infrastructure as a Service (IaaS)
21. Integrating GARP® With eDiscovery
• eDiscovery Cloud Solutions
– PaaS
• Various Platform Vendors
– Build e-Discovery Modules Leveraging Existing Platform
» Not Much of a Market / Business Model
» Re-Create the Wheel
– IaaS
• Various Cloud Vendors
– Build eDiscovery Solution on IaaS Instance
» Market / Business Model = All Cloud
» Leverage Existing Licensing
» Analogous to Hosting
22. Integrating GARP® With eDiscovery
• Integrating GARP® With eDiscovery
– People
• RIM, Counsel & IT
– Process
• Legal Holds/Litigation Response
• Protection/Compliance/Retention/Disposition
– Technology
• System of Origination
– ECM/EDM
– WCM
– Collaboration
• eDiscovery System
– Presentation/Collection/Archival
30. Integrating GARP® With eDiscovery
• Integrating GARP® With eDiscovery
– Reality
• “It’s the economy stupid.” – lean budgets, project holds.
• Change is difficult.
• Keep all mentality pervades.
– OR, highest common denominator (retention requirements).
• Departments have different retention schedules.
• Some organizations are more manual than others.
• Some law cases take a LONG time.
– Concurrent investigations/lawsuits affect retention.
• Fads fade.
– Lean Six Sigma in financial services.
– Legacy (“old school”) mentality for leadership.
31. Integrating GARP® With eDiscovery
• Case Study 1
– Background
– Drivers
– Technologies
– Limitations
– Risks
– Lessons Learned
– Next Steps
32. Integrating GARP® With eDiscovery
• Case Study 1
– Background
• CIO Wants to Implement SharePoint – Nix File Shares
• Financial Services SMB
• Staff: IT, 6 FTEs; Compliance, 1 FTE
– Drivers
• Compliance
• Disjointed Processes/Inefficiencies
– Technologies
• Email: Exchange Server 2010
• EDM: SharePoint 2010
• Discovery: Backups, Then Symantec Enterprise Vault 10.0
33. Integrating GARP® With eDiscovery
• Case Study 1
– Limitations
• No Records & Info Mgmt (RIM) Program
– ARMA, GARP®….huh?
• Organizational Behavior/Culture
• Budget
• Skill-sets
• Resources
– Risks
• Stakeholder Buy-in
• CIO Political Capital
• Program Upkeep/Maintenance
• Capital Expenditure Requirements
34. Integrating GARP® With eDiscovery
• Case Study 1
– Lessons Learned
• Stakeholder Buy-in Was Huge
• Don’t Forget the Fiefdoms
• Healthy Dose of Skepticism
– Email Backups
• Those in the Trenches Were the Champions
– Especially Internal Sales
35. Integrating GARP® With eDiscovery
• Case Study 1
– Next Steps
• Iterative Implementation of SharePoint
• Test eDiscovery Functionality
• Implement Document Mgmt Training & Awareness
• Publish Naming Conventions & RIM SOPs
• Scheduled:
– Records Retention Schedule (RRS) Update
– Records Clean-out
– GARP® Self-Assessment
36. Integrating GARP® With eDiscovery
• Case Study 2
– Background
– Drivers
– Technologies
– Limitations
– Risks
– Lessons Learned
– Next Steps
37. Integrating GARP® With eDiscovery
• Case Study 2
– Background
• RIM Program Dealing w/ Multiple Mergers & Acquisitions
• Mid-sized Pharmaceutical (Manufacturing & Sales)
• Staff: RIM, 1 FTE w/ Other Responsibilities
– Drivers
• Resource Limitations
• Limited Domain Knowledge
• Disjointed Processes/Inefficiencies
– Technologies
• Email: Exchange Server 2008
• EDM: SharePoint 2007
• Discovery: Backups, Then Symantec Enterprise Vault 9.0
38. Integrating GARP® With eDiscovery
• Case Study 2
– Limitations
• Currently in Litigation Response
• Program Conflicts:
– Priority
– Budget
– Interest
• Organizational Integration
• Disjointed Processes
– Risks
• Compliance
• Program Upkeep/Maintenance
• Operating Expenditure Requirements
39. Integrating GARP® With eDiscovery
• Case Study 2
– Lessons Learned
• Selling Process Improvement Was Huge
– Process Workflow
– Litigation Response
– Archiving
• Sell the Program Too
– Use by Competitors
– Use by Smaller Organizations
– Maturity Through GARP®
• Don’t Forget the Fiefdoms
– Need Decentralized Support Though
• Healthy Dose of Skepticism
– Verbal Promises
40. Integrating GARP® With eDiscovery
• Case Study 2
– Next Steps
• Deploy Email Policy
• Implement GARP® Training & Awareness
• Scheduled:
– Records Clean-out
– GARP® Self-Assessment
– Integrated Litigation Response Test
» Offsite Archiving Vendor
» Benefits Administrator
» Payroll Administrator
45. Integrating GARP® With eDiscovery
• Presentation Take-Aways
– Know Information Governance (e.g. GARP®)
– Know eDiscovery
– Learn To Integrate The Two Through:
– People
– Processes
– Technologies