MS SQL Server is a database server product of Microsoft that enables users to write and execute SQL queries and statements. It consists of several features like Query Analyzer, Profiler, and Service Manager. Profiler is a monitoring tool used for performance tuning. Service Manager helps manage SQL Server instances. Multiple instances can run on a single machine with each having independent users, databases, and settings. BCP is a command line utility that bulk copies data. Query Analyzer allows writing and executing SQL queries.
SQL Server consists of several features including Query Analyzer, Profiler, and Service Manager. Profiler is a monitoring tool used for performance tuning that uses traces. Service Manager helps manage SQL Server instances. Each instance is hidden from others and has its own users, databases, and settings. BCP is a utility for bulk data transfer. Query Analyzer allows writing and executing queries. SQL Server databases contain objects like tables, views, stored procedures. System databases include master, model, msdb, and tempdb. Databases are created in master and contain data and log files. Select statements retrieve data using conditions and options. Data is inserted, updated, and deleted using statements. Joins combine data from multiple tables. Views store
The document provides an overview of MS SQL Server including its key features like Query Analyzer, Profiler, Service Manager, and Bulk Copy Program. It discusses instances, databases, database objects, joins, views, functions and sequences. The summary focuses on the high-level topics covered in the document.
This document provides an overview of Oracle database and relational database management systems (RDBMS). It defines key terms like data, database, DBMS, RDBMS and describes table structure with rows and columns. Popular RDBMS like Oracle, SQL Server, DB2, Teradata and open-source options like PostgreSQL and MySQL are listed. Core SQL commands, joins, constraints, transactions and other relational database concepts are described at a high level.
This document discusses various topics related to enterprise resource planning (ERP) systems and technologies. It defines ERP as business process management software that integrates applications to manage business functions. It describes the typical lifecycle of an ERP implementation project, including pre-evaluation, evaluation, project planning, gap analysis, reengineering, training, testing, and post-implementation. It also discusses ERP-related technologies like business intelligence, supply chain management, and customer relationship management.
A database is a collection of information organized in a way that allows a computer program to select desired data quickly. A traditional database is organized into fields, records, and files. A field contains a single piece of information, a record contains one set of fields, and a file contains records.
A database management system (DBMS) is a collection of programs that allows users to enter, organize, and select data in a database. It performs functions like user management, data creation/modification/access, and database maintenance. Popular DBMS include Microsoft Access, Oracle, MySQL, SQL Server, and others.
Good database systems have ACID properties - Atomicity, Consistency, Isolation, and Durability.
The document provides an overview of database concepts and features in Oracle, including fundamentals like data grouping and relationships, as well as operations on tables like insert, update, delete. It also covers queries with filters, joins, and aggregations, as well as other objects like views, sequences, indexes, triggers, and stored procedures. The document is intended as training material for the Oracle database.
This document provides an overview of in-memory databases, summarizing different types including row stores, column stores, compressed column stores, and how specific databases like SQLite, Excel, Tableau, Qlik, MonetDB, SQL Server, Oracle, SAP Hana, MemSQL, and others approach in-memory storage. It also discusses hardware considerations like GPUs, FPGAs, and new memory technologies that could enhance in-memory database performance.
This chapter discusses advanced SQL features including relational set operators like UNION and INTERSECT, different types of joins, subqueries, functions, views, triggers, stored procedures, cursors, and embedded SQL. It covers topics like using subqueries in the SELECT, WHERE, HAVING and FROM clauses, correlated subqueries, date/string/numeric functions, updatable views, procedural language features in PL/SQL including triggers and stored procedures, and static versus dynamic embedded SQL.
SQL Server consists of several features including Query Analyzer, Profiler, and Service Manager. Profiler is a monitoring tool used for performance tuning that uses traces. Service Manager helps manage SQL Server instances. Each instance is hidden from others and has its own users, databases, and settings. BCP is a utility for bulk data transfer. Query Analyzer allows writing and executing queries. SQL Server databases contain objects like tables, views, stored procedures. System databases include master, model, msdb, and tempdb. Databases are created in master and contain data and log files. Select statements retrieve data using conditions and options. Data is inserted, updated, and deleted using statements. Joins combine data from multiple tables. Views store
The document provides an overview of MS SQL Server including its key features like Query Analyzer, Profiler, Service Manager, and Bulk Copy Program. It discusses instances, databases, database objects, joins, views, functions and sequences. The summary focuses on the high-level topics covered in the document.
This document provides an overview of Oracle database and relational database management systems (RDBMS). It defines key terms like data, database, DBMS, RDBMS and describes table structure with rows and columns. Popular RDBMS like Oracle, SQL Server, DB2, Teradata and open-source options like PostgreSQL and MySQL are listed. Core SQL commands, joins, constraints, transactions and other relational database concepts are described at a high level.
This document discusses various topics related to enterprise resource planning (ERP) systems and technologies. It defines ERP as business process management software that integrates applications to manage business functions. It describes the typical lifecycle of an ERP implementation project, including pre-evaluation, evaluation, project planning, gap analysis, reengineering, training, testing, and post-implementation. It also discusses ERP-related technologies like business intelligence, supply chain management, and customer relationship management.
A database is a collection of information organized in a way that allows a computer program to select desired data quickly. A traditional database is organized into fields, records, and files. A field contains a single piece of information, a record contains one set of fields, and a file contains records.
A database management system (DBMS) is a collection of programs that allows users to enter, organize, and select data in a database. It performs functions like user management, data creation/modification/access, and database maintenance. Popular DBMS include Microsoft Access, Oracle, MySQL, SQL Server, and others.
Good database systems have ACID properties - Atomicity, Consistency, Isolation, and Durability.
The document provides an overview of database concepts and features in Oracle, including fundamentals like data grouping and relationships, as well as operations on tables like insert, update, delete. It also covers queries with filters, joins, and aggregations, as well as other objects like views, sequences, indexes, triggers, and stored procedures. The document is intended as training material for the Oracle database.
This document provides an overview of in-memory databases, summarizing different types including row stores, column stores, compressed column stores, and how specific databases like SQLite, Excel, Tableau, Qlik, MonetDB, SQL Server, Oracle, SAP Hana, MemSQL, and others approach in-memory storage. It also discusses hardware considerations like GPUs, FPGAs, and new memory technologies that could enhance in-memory database performance.
This chapter discusses advanced SQL features including relational set operators like UNION and INTERSECT, different types of joins, subqueries, functions, views, triggers, stored procedures, cursors, and embedded SQL. It covers topics like using subqueries in the SELECT, WHERE, HAVING and FROM clauses, correlated subqueries, date/string/numeric functions, updatable views, procedural language features in PL/SQL including triggers and stored procedures, and static versus dynamic embedded SQL.
Database is a collection of organized data that allows for easy updating and modification of stored data. Data is stored permanently in tables which organize data into rows and columns. SQL is the language used to access and modify database data using statements. DDL statements are used to define database schema through commands like Create, Alter, and Drop. DML statements manipulate data through Insert, Update, Delete, and Select commands. JDBC provides an API for connecting Java programs to databases to perform operations like executing statements and queries, and retrieving and modifying data.
This document provides an outline of a SQL Lab tutorial covering MySQL. It introduces SQL and connecting to MySQL. It then covers various MySQL commands including administration commands, data definition language commands to create/drop databases and tables, data manipulation language commands to insert, retrieve, update and delete records, and more advanced queries using concepts like joins, aggregation, and pattern matching. SQL is introduced as a standard language for accessing and manipulating database systems and working with different database programs.
Redshift is Amazon's cloud data warehousing service that allows users to interact with S3 storage and EC2 compute. It uses a columnar data structure and zone maps to optimize analytic queries. Data is distributed across nodes using either an even or keyed approach. Sort keys and queries are optimized using statistics from ANALYZE operations while VACUUM reclaims space. Security, monitoring, and backups are managed natively with Redshift.
ds 1 Introduction to Data Structures.pptAlliVinay1
This document provides an introduction and overview of data structures. It begins by defining key terms like data, information, and entities. It then discusses how data structures represent logical relationships between data elements and how they should be easy to process and represent relationships. The document classifies common data structures as linear, non-linear, homogeneous, non-homogeneous, dynamic, and static. It also provides examples of basic notations, algorithms, control structures, and applications of different data structure types like arrays, stacks, queues, linked lists, trees, and graphs. Finally, it discusses complexity analysis and the tradeoff between time and space.
This document provides an introduction to Excel, Word, and PowerPoint. It discusses the basics of spreadsheets in Excel including creating and formatting worksheets, calculations with formulas, and copying data to other programs. It also covers creating and formatting presentations in PowerPoint including adding slides, text, images, and charts. Finally, it discusses opening and viewing documents in Word and resources for learning more about Microsoft Office applications.
The document provides information about various SQL concepts like views, triggers, functions, indexes, and joins. It defines views as virtual tables created by queries on other tables. Triggers are blocks of code that execute due to data modification language statements on tables. Functions allow reusable code and improve clarity. Indexes speed up searches by allowing fast data retrieval. Joins combine data from two or more tables based on relationships between columns. Stored procedures are SQL statements with an assigned name that are stored for shared use.
The document provides an overview of data structures and algorithms. It discusses key topics like:
1) Different types of data structures including primitive, linear, non-linear, and arrays.
2) The importance of algorithms and how to write them using steps, comments, variables, and control structures.
3) Common operations on data structures like insertion, deletion, searching, and sorting.
4) Different looping and selection statements that can be used in algorithms like for, while, and if-then-else.
5) How arrays can be used as a structure to store multiple values in a contiguous block of memory.
This document provides an introduction and overview of PostgreSQL, including its history, features, installation, usage and SQL capabilities. It describes how to create and manipulate databases, tables, views, and how to insert, query, update and delete data. It also covers transaction management, functions, constraints and other advanced topics.
This document provides an overview of SQL concepts including:
- Data types like char, varchar, and null
- Core SQL commands like select, update, delete, truncate, and alter
- Joins like inner, left, and cross joins
- Transaction control with commit, rollback, and savepoints
- Exception handling and the ACID properties of transactions
- Other objects like views, sequences, indexes, and stored procedures
The document discusses various Python libraries used for data science tasks. It describes NumPy for numerical computing, SciPy for algorithms, Pandas for data structures and analysis, Scikit-Learn for machine learning, Matplotlib for visualization, and Seaborn which builds on Matplotlib. It also provides examples of loading data frames in Pandas, exploring and manipulating data, grouping and aggregating data, filtering, sorting, and handling missing values.
The document discusses Structured Query Language (SQL) and its basic statements. It covers:
- SQL is used to request and retrieve data from databases. The DBMS processes SQL queries and returns results.
- SQL statements are divided into DDL (data definition language) for managing schema, DML (data manipulation language) for data queries/modification, and DCL (data control language) for managing transactions and access control.
- The document provides examples of using SQL commands like CREATE TABLE, ALTER TABLE, DROP TABLE, INSERT, UPDATE, DELETE, SELECT and indexes. It also covers data types, constraints and operators used in SQL queries.
Introduction to MySQL Query Tuning for Dev[Op]sSveta Smirnova
To get data, we query the database. MySQL does its best to return requested bytes as fast as possible. However, it needs human help to identify what is important and should be accessed in the first place.
Queries, written smartly, can significantly outperform automatically generated ones. Indexes and Optimizer statistics, not limited to the Histograms only, help to increase the speed of the query a lot.
In this session, I will demonstrate by examples of how MySQL query performance can be improved. I will focus on techniques, accessible by Developers and DevOps rather on those which are usually used by Database Administrators. In the end, I will present troubleshooting tools which will help you to identify why your queries do not perform. Then you could use the knowledge from the beginning of the session to improve them.
It consists of 3 major parts:
1) Cassandra architecture / how reads and writes work - It is almost in alignment with official C* book by DataStax (pictures are from there) - It can be useful for those who either never used Cassandra or has some questions. During my presentation on-site I found that it makes sense to listen to this even for those who already read it sometime ago
2) Data Modeling on CQL3 - it can be helpful for those who never used Cassandra to learn CQL3 a little - as well as for those who worked with pre-CQL3 approach to understand what happens under the sweet CQL3 structures
3) Remaining things like DataStax Java Driver, C* known bugs
Learn best practices for taking advantage of Amazon Redshift's columnar technology and parallel processing capabilities to improve your data warehouse performance.
This document provides an introduction to data models and SQL. It discusses the relational data model where data is stored in tables/relations with rows and columns. It describes keys such as primary and foreign keys. The document then introduces SQL commands for creating tables, inserting, updating, deleting and querying data. It provides examples of using SQL with the SQLite database and discusses physical data independence.
Queries allow users to extract specific information from one or more database tables. There are different ways to create queries, including using design view, a wizard, or SQL view. Queries can include calculations, formatting, parameters, and summaries to provide flexible reporting of essential data.
This document provides an overview of an introductory training session on SQLite, a popular database for Internet of Things (IoT) applications. The agenda covers installing and configuring SQLite, basic commands like .tables and .schema, accessing databases using ATTACH and DETACH, data types, operators, and SQL statements like SELECT, INSERT, UPDATE, and DELETE. The session teaches the basics of using SQLite through examples of commands, queries, and making changes to databases.
SQL stands for Structured Query Language
SQL lets you access and manipulate databases
SQL became a standard of the American National Standards Institute (ANSI) in 1986, and of the International Organization for Standardization (ISO) in 1987
This document provides an introduction to PHP basics. It outlines how to set up a PHP development environment using XAMPP on Windows or Ubuntu. It explains how to clone a GitHub repository for learning PHP. The document then covers PHP syntax, variables, data types, operators, conditional statements, loops, functions, and superglobals. It concludes by providing some basic exercises to practice the covered PHP concepts.
Database is a collection of organized data that allows for easy updating and modification of stored data. Data is stored permanently in tables which organize data into rows and columns. SQL is the language used to access and modify database data using statements. DDL statements are used to define database schema through commands like Create, Alter, and Drop. DML statements manipulate data through Insert, Update, Delete, and Select commands. JDBC provides an API for connecting Java programs to databases to perform operations like executing statements and queries, and retrieving and modifying data.
This document provides an outline of a SQL Lab tutorial covering MySQL. It introduces SQL and connecting to MySQL. It then covers various MySQL commands including administration commands, data definition language commands to create/drop databases and tables, data manipulation language commands to insert, retrieve, update and delete records, and more advanced queries using concepts like joins, aggregation, and pattern matching. SQL is introduced as a standard language for accessing and manipulating database systems and working with different database programs.
Redshift is Amazon's cloud data warehousing service that allows users to interact with S3 storage and EC2 compute. It uses a columnar data structure and zone maps to optimize analytic queries. Data is distributed across nodes using either an even or keyed approach. Sort keys and queries are optimized using statistics from ANALYZE operations while VACUUM reclaims space. Security, monitoring, and backups are managed natively with Redshift.
ds 1 Introduction to Data Structures.pptAlliVinay1
This document provides an introduction and overview of data structures. It begins by defining key terms like data, information, and entities. It then discusses how data structures represent logical relationships between data elements and how they should be easy to process and represent relationships. The document classifies common data structures as linear, non-linear, homogeneous, non-homogeneous, dynamic, and static. It also provides examples of basic notations, algorithms, control structures, and applications of different data structure types like arrays, stacks, queues, linked lists, trees, and graphs. Finally, it discusses complexity analysis and the tradeoff between time and space.
This document provides an introduction to Excel, Word, and PowerPoint. It discusses the basics of spreadsheets in Excel including creating and formatting worksheets, calculations with formulas, and copying data to other programs. It also covers creating and formatting presentations in PowerPoint including adding slides, text, images, and charts. Finally, it discusses opening and viewing documents in Word and resources for learning more about Microsoft Office applications.
The document provides information about various SQL concepts like views, triggers, functions, indexes, and joins. It defines views as virtual tables created by queries on other tables. Triggers are blocks of code that execute due to data modification language statements on tables. Functions allow reusable code and improve clarity. Indexes speed up searches by allowing fast data retrieval. Joins combine data from two or more tables based on relationships between columns. Stored procedures are SQL statements with an assigned name that are stored for shared use.
The document provides an overview of data structures and algorithms. It discusses key topics like:
1) Different types of data structures including primitive, linear, non-linear, and arrays.
2) The importance of algorithms and how to write them using steps, comments, variables, and control structures.
3) Common operations on data structures like insertion, deletion, searching, and sorting.
4) Different looping and selection statements that can be used in algorithms like for, while, and if-then-else.
5) How arrays can be used as a structure to store multiple values in a contiguous block of memory.
This document provides an introduction and overview of PostgreSQL, including its history, features, installation, usage and SQL capabilities. It describes how to create and manipulate databases, tables, views, and how to insert, query, update and delete data. It also covers transaction management, functions, constraints and other advanced topics.
This document provides an overview of SQL concepts including:
- Data types like char, varchar, and null
- Core SQL commands like select, update, delete, truncate, and alter
- Joins like inner, left, and cross joins
- Transaction control with commit, rollback, and savepoints
- Exception handling and the ACID properties of transactions
- Other objects like views, sequences, indexes, and stored procedures
The document discusses various Python libraries used for data science tasks. It describes NumPy for numerical computing, SciPy for algorithms, Pandas for data structures and analysis, Scikit-Learn for machine learning, Matplotlib for visualization, and Seaborn which builds on Matplotlib. It also provides examples of loading data frames in Pandas, exploring and manipulating data, grouping and aggregating data, filtering, sorting, and handling missing values.
The document discusses Structured Query Language (SQL) and its basic statements. It covers:
- SQL is used to request and retrieve data from databases. The DBMS processes SQL queries and returns results.
- SQL statements are divided into DDL (data definition language) for managing schema, DML (data manipulation language) for data queries/modification, and DCL (data control language) for managing transactions and access control.
- The document provides examples of using SQL commands like CREATE TABLE, ALTER TABLE, DROP TABLE, INSERT, UPDATE, DELETE, SELECT and indexes. It also covers data types, constraints and operators used in SQL queries.
Introduction to MySQL Query Tuning for Dev[Op]sSveta Smirnova
To get data, we query the database. MySQL does its best to return requested bytes as fast as possible. However, it needs human help to identify what is important and should be accessed in the first place.
Queries, written smartly, can significantly outperform automatically generated ones. Indexes and Optimizer statistics, not limited to the Histograms only, help to increase the speed of the query a lot.
In this session, I will demonstrate by examples of how MySQL query performance can be improved. I will focus on techniques, accessible by Developers and DevOps rather on those which are usually used by Database Administrators. In the end, I will present troubleshooting tools which will help you to identify why your queries do not perform. Then you could use the knowledge from the beginning of the session to improve them.
It consists of 3 major parts:
1) Cassandra architecture / how reads and writes work - It is almost in alignment with official C* book by DataStax (pictures are from there) - It can be useful for those who either never used Cassandra or has some questions. During my presentation on-site I found that it makes sense to listen to this even for those who already read it sometime ago
2) Data Modeling on CQL3 - it can be helpful for those who never used Cassandra to learn CQL3 a little - as well as for those who worked with pre-CQL3 approach to understand what happens under the sweet CQL3 structures
3) Remaining things like DataStax Java Driver, C* known bugs
Learn best practices for taking advantage of Amazon Redshift's columnar technology and parallel processing capabilities to improve your data warehouse performance.
This document provides an introduction to data models and SQL. It discusses the relational data model where data is stored in tables/relations with rows and columns. It describes keys such as primary and foreign keys. The document then introduces SQL commands for creating tables, inserting, updating, deleting and querying data. It provides examples of using SQL with the SQLite database and discusses physical data independence.
Queries allow users to extract specific information from one or more database tables. There are different ways to create queries, including using design view, a wizard, or SQL view. Queries can include calculations, formatting, parameters, and summaries to provide flexible reporting of essential data.
This document provides an overview of an introductory training session on SQLite, a popular database for Internet of Things (IoT) applications. The agenda covers installing and configuring SQLite, basic commands like .tables and .schema, accessing databases using ATTACH and DETACH, data types, operators, and SQL statements like SELECT, INSERT, UPDATE, and DELETE. The session teaches the basics of using SQLite through examples of commands, queries, and making changes to databases.
SQL stands for Structured Query Language
SQL lets you access and manipulate databases
SQL became a standard of the American National Standards Institute (ANSI) in 1986, and of the International Organization for Standardization (ISO) in 1987
This document provides an introduction to PHP basics. It outlines how to set up a PHP development environment using XAMPP on Windows or Ubuntu. It explains how to clone a GitHub repository for learning PHP. The document then covers PHP syntax, variables, data types, operators, conditional statements, loops, functions, and superglobals. It concludes by providing some basic exercises to practice the covered PHP concepts.
This document summarizes recent advances in using deep learning to analyze magnetic resonance imaging (MRI) data from glioma patients. It discusses:
1) How MRI is used to diagnose gliomas and evaluate key biomarkers like tumor grade, molecular characteristics, and treatment response.
2) Publicly available datasets like the Multimodal Brain Tumor Segmentation Challenge that have helped develop new computational tools.
3) The current state-of-the-art in glioma image segmentation using deep learning models developed through challenges like BraTS.
The document discusses the evolution of SQL Server from 2000 to 2014, highlighting new features over time like XML, compression, and AlwaysOn availability groups. It focuses on the new in-memory capabilities in SQL Server 2014 like an in-memory optimized database engine and columnstore indexing that provide up to 10x performance improvements. Resources are provided for learning more about SQL Server 2014 and related products like Power BI, HDInsight, and Windows Azure.
This document summarizes new features in SQL Server 2019 including intelligent query processing, data classification and auditing, accelerated database recovery, data virtualization, SQL Server replication in one command, additional capabilities and migration tools, and a modern platform with Linux, containers, and machine learning services. It provides examples of how these features can help solve modern data challenges and gain performance without changing applications.
SQL Server is a relational database management system developed by Microsoft. It supports the SQL language and Microsoft's proprietary T-SQL language. Microsoft and Sybase originally released SQL Server 1.0 in 1989. Key editions include Enterprise, Standard, Web, Developer, and Express. Running multiple SQL Server instances on the same machine provides advantages like installation of different versions, maintenance of separate environments, and reducing temporary database problems. The document discusses how to create, alter, drop, backup, and restore SQL Server databases and tables. It also covers SQL Server data types and how to perform data manipulation operations like insert, update, and delete. Additionally, it explains how to define primary keys, foreign keys, unique constraints, check constraints, and
SQL Server 2016 provides a consistent platform for hybrid cloud environments with built-in in-memory capabilities, high performance, and enterprise-grade security and availability features. New capabilities in SQL Server 2016 include enhanced AlwaysOn availability groups for increased scalability, manageability and failover support. The document discusses SQL Server 2016's position as a leader in key industry analyses and outlines new features in high availability, in-memory technologies, and mobile and hybrid cloud capabilities.
This document provides an introduction to a course on Transact-SQL (T-SQL) taught by Graeme Malcolm from Microsoft and Geoff Allix from Content Master. The course covers topics such as querying tables, joins, functions, and error handling. It will use online video, labs, and the AdventureWorksLT sample database hosted on Azure SQL Database. The document outlines the course instructors, topics, expectations, lab environment, and resources for further learning SQL Server and certification.
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
State of Artificial intelligence Report 2023kuntobimo2016
Artificial intelligence (AI) is a multidisciplinary field of science and engineering whose goal is to create intelligent machines.
We believe that AI will be a force multiplier on technological progress in our increasingly digital, data-driven world. This is because everything around us today, ranging from culture to consumer products, is a product of intelligence.
The State of AI Report is now in its sixth year. Consider this report as a compilation of the most interesting things we’ve seen with a goal of triggering an informed conversation about the state of AI and its implication for the future.
We consider the following key dimensions in our report:
Research: Technology breakthroughs and their capabilities.
Industry: Areas of commercial application for AI and its business impact.
Politics: Regulation of AI, its economic implications and the evolving geopolitics of AI.
Safety: Identifying and mitigating catastrophic risks that highly-capable future AI systems could pose to us.
Predictions: What we believe will happen in the next 12 months and a 2022 performance review to keep us honest.
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeWalaa Eldin Moustafa
Dynamic policy enforcement is becoming an increasingly important topic in today’s world where data privacy and compliance is a top priority for companies, individuals, and regulators alike. In these slides, we discuss how LinkedIn implements a powerful dynamic policy enforcement engine, called ViewShift, and integrates it within its data lake. We show the query engine architecture and how catalog implementations can automatically route table resolutions to compliance-enforcing SQL views. Such views have a set of very interesting properties: (1) They are auto-generated from declarative data annotations. (2) They respect user-level consent and preferences (3) They are context-aware, encoding a different set of transformations for different use cases (4) They are portable; while the SQL logic is only implemented in one SQL dialect, it is accessible in all engines.
#SQL #Views #Privacy #Compliance #DataLake
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Aggregage
This webinar will explore cutting-edge, less familiar but powerful experimentation methodologies which address well-known limitations of standard A/B Testing. Designed for data and product leaders, this session aims to inspire the embrace of innovative approaches and provide insights into the frontiers of experimentation!
2. Introduction
• MS SQL Server is a database server
• Product of Microsoft
• Enables user to write queries and other
SQL statements and execute them
• Consists of several features. A few are:
– Query Analyzer
– Profiler
– Service Manager
– Bulk Copy Program (BCP)
3. Profiler
• Monitoring tool
• Used for performance tuning
• Uses traces – an event monitoring
protocol
• Event may be a query or a transaction like
logins etc
4. Service Manager
• Helps us to manage services
• More than one instance of SQL server can
be installed in a machine
• First Instance is called as default instance
• Rest of the instances (16 max) are called
as named instances
• Service manager helps in starting or
stopping the instances individually
5. Instances
• Each instance is hidden from another instance
• Enhances security
• Every instance has its own set of Users, Admins,
Databases, Collations
• Advantage of having multiple instance is
– Multi company support (Each company can have its
own instance and create databases on the same
server, independent on each other)
– Server consolidation (Can host up to 10 server
applications on a single machine)
6. BCP
• Bulk Copy Program
• A powerful command line utility that
enables us to transfer large number of
records from a file to database
• Time taken for copying to and from
database is very less
• Helps in back up and restoration
7. Query Analyzer
• Allows us to write queries and SQL
statements
• Checks syntax of the SQL statement
written
• Executes the statements
• Store and reload statements
• Save the results in file
• View reports (either as grid or as a text)
8. SQL Database Objects
• A SQL Server database has lot of objects like
– Tables
– Views
– Stored Procedures
– Functions
– Rules
– Defaults
– Cursors
– Triggers
9. System Databases
• By default SQL server has 4 databases
– Master : System defined stored procedures,
login details, configuration settings etc
– Model : Template for creating a database
– Tempdb : Stores temporary tables. This db is
created when the server starts and dropped
when the server shuts down
– Msdb : Has tables that have details with
respect to alerts, jobs. Deals with SQL Server
Agent Service
10. Creating a database
• We need to use Master database for
creating a database
• By default the size of a database is 1 MB
• A database consists of
– Master Data File (.mdf)
– Primary Log File (.ldf)
11. Database operations
• Changing a database
Use <dbname>
• Creating a database
Create database <dbname>
• Dropping a database
Drop database <dbname>
12. SQL Server Data types
• Integer : Stores whole number
• Float : Stores real numbers
• Text: Stores characters
• Decimal: Stores real numbers
• Money : Stores monetary data. Supports 4 places
after decimal
• Date : Stores date and time
• Binary : Stores images and other large objects
• Miscellaneous : Different types special to SQL Server.
(Refer to notes for more info)
14. Select Statements
• To execute a statement in MS SQL, Select the statement and
Click on the Execute button in the query analyser or press F5
• This is used to retrive records from a table
• Eg. Select * from table1;
– This will fetch all rows and all columns from table1
• Eg. Select col1,col2 from table1
– This will fetch col1 and col2 from table1 for all rows
• Eg. Select * from table1 where <<condn>>
– This will fetch all rows from table1 that satisfies a condition
• Eg. Select col1,col2 from table1 where <<condn>>
– This will fetch col1 and col2 of rows from table1 that satisfies a
condition
15. Select Options
• Aggregate functions
– Sum(col1): sum of data in the column col1
– Max(col1): data with maximum value in col1
– Min(col1): data with minimum value in col1
– Avg(col1): Average of data in col1
– Count(col1): Number of not null records in table
• Grouping – Group by col1 : Groups data by col1
• Ordering – Order by col1 : Orders the result in
ascending order (default order) of col1
• Filtering – Where <<condn>> and Having
<<condn>>
16. Table management
Create table tablename
(
col1 data type,
col2 data type
);
- Creates a table with two columns
Drop table tablename;
- Drops the table structure
17. Insert statements
• Inserting data to all columns
– Insert into tablename(col1,col2) values(v1,v2)
– Insert into tablename values(v1,v2)
• Inserting data to selected columns
– Insert into tablename(col1) values (v1)
– Insert into tablename(col2) values (v2)
18. Update statement
Update table tablename
Set colname=value
- This updates all rows with colname set to value
Update table tablename
Set colname=value
Where <<condition>>
- This updates selected rows with colname as
value only if the row satisfies the condition
19. Delete statements
Delete from table1;
Deletes all rows in table1
Delete from table1 where <<condition>>
Deletes few rows from table1 if they satisfy
the condition
20. Truncate statement
• Truncate table tablename
• Removes all rows in a table
• Resets the table.
• Truncate does the following, where as
delete statement does not
– Releases the memory used
– Resets the identity value
– Does not invoke delete trigger
21. Alter statements
• Used to modify table structure
– Add new column
– Change data type of existing column
– Delete a column
– Add or remove constraints like foreign key,
primary key
22. More table commands
• Viewing tables in a data base:
– Exec sp_tables “a%”
– This gives all tables in the current database
that starts with “a”
• Viewing table strucure:
– Exec sp_columns <<tablename>>
– Exec sp_columns student;
23. Joins
• Cross Join
– Cartesian product. Simply merges two tables.
• Inner Join
– Cross join with a condition. Used to find matching
records in the two tables
• Outer Join
– Used to find un matched rows in the two tables
• Self Join
– Joining a table with itself
24. Cross Join
There are two tables A and B
A has a column Id and data (1,2,3)
B has a column Id and data (A,B)
If I put
Select A.Id, B.Id from A,B
This generates output as
A 1
B 1
C 1
A 2
B 2
C 2
25. Self Join
There is a table called Emp with the following structure:
empid ename mgrid
1 A null
2 B 1
3 C 1
4 D 2
If I want to print all managers using self join, I should write quey as:
select e1.ename from
emp e1,emp e2
where e1.mgrid = e2.empid
26. Inner Join
I have 2 tables Student(sid,Name) and Marks(Sid,Subject,Score)
If I want to print the marks of all students in the following format,
Name Subject Score
Select Name,Subject,Score from
Student s join Marks m
On s.sid = m.sid
27. Outer Join
• Right outer Join
– Print all the records in the second table with null
values for missing records in the first table
• Left outer Join
– Print all the records in the first table with null values
for missing records in the second table
• Full outer Join
– Prints all records in both the table with null values for
missing records in both the table
28. Left Outer Join
I have a table Employee (Eid, Ename, Mid) and
a table Machine (Mid,ManufacturerName)
Employee
Eid EName Mid
1 ABC 1
2 DEF 3
Machine
Mid ManufacturerName
1 Zenith
2 HP
29. Left Outer Join
I want to print the employee name and machine name.
If I write a query using inner join, then the second employee will
not be displayed as the mid in his record is not avilable with the second
table.
So I go for left outer join. The query is as shown below:
Select Ename, ManufacturerName from Employee e left outer join
Machine m on e.Mid = m.Mid
30. Right outer Join
Assume data in the tables like this:
Employee
Eid EName Mid
1 ABC 1
2 DEF
Machine
Mid ManufacturerName
1 Zenith
2 HP
31. Right Outer Join
If I want to find which machine is unallocated, I can use right outer join.
The query is as follows:
Select Ename, ManufacturerName from Employee e right outer join
Machine m on e.Mid = m.Mid
This yields a result
ABC Zenith
HP
32. Full Outer Join
Assume data in the tables like this:
Employee
Eid EName Mid
1 ABC 1
2 DEF
3 GHI 2
Machine
Mid ManufacturerName
1 Zenith
2 HP
3 Compaq
33. Full Outer Join
If I want to find people who have been un allocated with a system and
machines that are been un allocated, I can go for full outer join.
Query is like this:
Select Ename, ManufacturerName from Employee e full outer join
Machine m on e.Mid = m.Mid
This yields a result
ABC Zenith
DEF
GHI HP
Compaq
34. Views
• Views are logical tables
• They are pre compiled objects
• We can select few columns or rows from a
table and put the data set in a view and
can use view in the same way as we use
tables
35. Views
• Create views:
Create view viewname as select stmt
Create view view_emp as select empid,
empname from employee;
• Select from views:
Select * from viewname
Select empid,empname view_emp;
• Drop views:
Drop view viewname
Drop view view_emp;
36. String Functions
• Substring(string,start,length) – Will fetch
characters starting at a specific index extending
to length specified.
• Left(string,length) – Fetches number of
characters specified by length from left of the
string
• Right(string,length) – Fetches number of
characters specified by length from right of the
string
• Len(string) – Returns the length of a string
37. String Functions
• Ltrim(string) – Removes leading spaces in
a string
• Rtrim(string) – Removes trailing spaces in
a string
• Lower(string) – Converts the characters in
a string to lower case
• Upper(string) – Converts the characters in
a string to upper case
38. Numeric Functions
• ABS(Number) – Fetches the modulo value
(Positive value) of a number
• CEILING(Number) – Fetches the closest
integer greater than the number
• FLOOR(Number) – Fetches the closest
integer smaller than the number
• EXP(Number) – Fetches the exponent of a
number
39. Numeric Functions
• POWER(x,y) – Fetches x raised to the
power of y
• LOG(Number) – Fetches the natural
logarithmic value of the number
• LOG10(Number) – Fetches log to the base
10 of a number
• SQRT(Number) – Fetches the square root
of a number
40. Indexes
• Indexes make search and retrieve fast in a
database
• This is for optimizing the select statement
• Types of index
– Unique
– Non unique
– Clustered
– Non clustered
41. Index
Create index indexname on
tablename(columnname)
This creates a non clustered index on a table
Create unique clustered index index_name on
Student(sname);
This creates a unique and clustered index on the
Column Sname.
42. Sequences
• This creates an auto increment for a
column
• If a table has a column with sequence or
auto increment, the user need not insert
data explicitly for the column
• Sequence is implemented using the
concept of Identity
43. Identity
• Identity has
– A seed
– An increment
• Seed is the initial value
• Increment is the value by which we need
to skip to fetch the nextvalue
• Identity(1,2) will generate sequence
numbers 1,3,5,7…
44. Sample
Create table table1
(
Id integer identity(1,1),
Name varchar(10)
)
It is enough if we insert like this:
Insert into table1(name) values(‘Ram’);
Ram will automatically assigned value 1 for id