Welcome!

Eclipse Authors: Pat Romanski, Elizabeth White, Liz McMillan, David H Deans, JP Morgenthal

Related Topics: @CloudExpo, Linux Containers

@CloudExpo: Blog Feed Post

SQL Data Services: Your Database in the Cloud

This will really make sharing of data in the cloud so much easier

One thing in the Microsoft cloud I find really interesting is SQL Data Services and Huron/Data Hub - SQL cloud sync service, one of the “cloud” offerings I believe has lots of potential and will really make sharing of data in the cloud so much easier.

I had the pleasure to sit down and talk about this subject with Liam Cavanagh, Sr. Program Manager at Microsoft, with the SDS/Huron team, and get some insights about the current state and the future of this remarkable new technology. In this article I’ll talk about SQL Data Services, and I’ll follow up with one about Data Hub/Huron.

SQL Data Services is at the core, nothing more than a (Microsoft SQL) database-as-a-service offering from Microsoft, part of the Azure Services Platform. First thing you’ll find about SQL Data Services is that “is just SQL” (at least that’s how Microsoft is advertising it). And it is. You’re able to change your connection string from your local database to your cloud database and you can access the “cloud” SQL. You can use SQL Studio to run queries, create tables, everything (oh well, almost) you do locally. First version of SQL Data Services will support: tables, indexes, views, stored procedures, triggers, constraints, table variables, session temp tables etc. It will not support: distributed transactions or queries, CLR, Service Broker, Spatial, physical server or catalog DDL and views. Also, reporting services, Business Intelligence  services, will be available sometimes in the future. So far there’s no information for when some of the features not included in the first version will be available.

The initial commercial release will have some limitations on database size, most likely it will be around 10 GB. The limitation might be lifted on future releases, but for now will be there to stay. This limitation is mainly because Microsoft feels that this is a good size they can easily manage in the background: backups, moving the database from a server to another server, data recovery, etc. You can have as many databases you want, and let’s be honest, 10 GB is a lot of data to store.

Other limitation will have to do with the duration of transactions and resource load on the server hosting your data. Keep in mind that your data will be living on servers in Microsoft’s data centers, with data from other customers. Microsoft makes sure your data is secure (I’m sure we’ll see some guarantees in the SLA), but in order to maintain good multi-tenant practices it will have to throttle or otherwise make sure that all the databases on the server get enough resources to function properly. One of the techniques used is moving more active databases from a loaded server to an idle server.

Like with any other database, corruption of data can happen in the cloud database as well. Microsoft has mechanisms in place to recover from data corruption (mainly by keeping database replicas on multiple servers), however, they don’t provide any user level backup of the database (at least in the first version). As we’ve seen in some of the PDC 2008 presentations, in the future we will probably see database backup/restore and geo-replication (synchronous – replica set spans datacenters and asynchronous – independent replica sets in different datacenters).

There’s no surprise on how concurrency is handled in the cloud database, SDS has the same mechanism like any SQL Server. SQL Server supports optimistic (time-stamps or value comparisons) or pessimistic concurrency models. The presence of the “cloud” doesn’t change the model at all. If you’re really curious about the subject, here’s a link to some information about SQL Server 2008 Concurrency which essentially deals with how the SQL Server handles locking.

By having the database in the cloud, there’s going to be a latency when accessing it from your premises. Microsoft recommends running your applications that are using the database in the cloud on the Azure Platform, so the latency is minimal. When you deploy an application on Windows Azure and provision an SDS server, the two are going to be co-located, to provide low latency between the application and the data.

You will find out rather quickly that there’s no web based administration tool for managing your database in the cloud, but most probably some kind of web admin tool (Microsoft or third party) will be available in the near future.

The exact billing model is not yet available. However, we know from Nigel Ellis (the person responsible for the design, development, and release of SQL Data Services) that customers will be charged for the physical database size including all data and indexes defined.

What is SDS offering more than other SQL hosting services? High availability - your data is guaranteed, is available all the time. If you’re hosting SQL, in order to have high availability, you need to probably have two servers (mirrored) in case one goes down, the other one can take over. Also, SDS solution seems to be cost effective, since you pay just for what you’re using.

Initially SDS was built to use SOAP and REST protocols to access the data. With the switch to being a full relational database in the cloud, SDS is now using Tabular Data Stream (TDS) protocol, an application layer protocol used to transfer data between a database server and a client, initially developed by Sybase Inc. for their Sybase SQL Server relational database engine in 1984, and later by Microsoft in Microsoft SQL Server. There are already lots of drivers already implemented for this protocol: ODBC, OLEDB, ADO .NET, ODBC driver for PHP stack, you can access it from ruby, from linux using the Open TDS driver.

Of course, it will take some time for the platform to mature. It is the goal of this first version to address the needs of 95% or more web and departmental applications.

The SQL Data Services Community Technology Preview (CTP) will be available soon. You can join the mailing list in order to receive an e-mail notification when it will become available.

Related posts:

Read the original blog entry...

More Stories By Alin Irimie

Alin Irimie is a software engineer - architect, designer, and developer with over 10 years experience in various languages and technologies. Currently he is Messaging Security Manager at Sunbelt Software, a security company. He is also the CTO of RADSense Software, a software consulting company. He has expertise in Microsoft technologies such as .NET Framework, ASP.NET, AJAX, SQL Server, C#, C++, Ruby On Rails, Cloud computing (Amazon and Windows Azure),and he also blogs about cloud technologies here.

IoT & Smart Cities Stories
Bill Schmarzo, author of "Big Data: Understanding How Data Powers Big Business" and "Big Data MBA: Driving Business Strategies with Data Science," is responsible for setting the strategy and defining the Big Data service offerings and capabilities for EMC Global Services Big Data Practice. As the CTO for the Big Data Practice, he is responsible for working with organizations to help them identify where and how to start their big data journeys. He's written several white papers, is an avid blogge...
Nicolas Fierro is CEO of MIMIR Blockchain Solutions. He is a programmer, technologist, and operations dev who has worked with Ethereum and blockchain since 2014. His knowledge in blockchain dates to when he performed dev ops services to the Ethereum Foundation as one the privileged few developers to work with the original core team in Switzerland.
René Bostic is the Technical VP of the IBM Cloud Unit in North America. Enjoying her career with IBM during the modern millennial technological era, she is an expert in cloud computing, DevOps and emerging cloud technologies such as Blockchain. Her strengths and core competencies include a proven record of accomplishments in consensus building at all levels to assess, plan, and implement enterprise and cloud computing solutions. René is a member of the Society of Women Engineers (SWE) and a m...
Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life settlement products to hedge funds and investment banks. After, he co-founded a revenue cycle management company where he learned about Bitcoin and eventually Ethereal. Andrew's role at ConsenSys Enterprise is a mul...
In his general session at 19th Cloud Expo, Manish Dixit, VP of Product and Engineering at Dice, discussed how Dice leverages data insights and tools to help both tech professionals and recruiters better understand how skills relate to each other and which skills are in high demand using interactive visualizations and salary indicator tools to maximize earning potential. Manish Dixit is VP of Product and Engineering at Dice. As the leader of the Product, Engineering and Data Sciences team at D...
Dynatrace is an application performance management software company with products for the information technology departments and digital business owners of medium and large businesses. Building the Future of Monitoring with Artificial Intelligence. Today we can collect lots and lots of performance data. We build beautiful dashboards and even have fancy query languages to access and transform the data. Still performance data is a secret language only a couple of people understand. The more busine...
Whenever a new technology hits the high points of hype, everyone starts talking about it like it will solve all their business problems. Blockchain is one of those technologies. According to Gartner's latest report on the hype cycle of emerging technologies, blockchain has just passed the peak of their hype cycle curve. If you read the news articles about it, one would think it has taken over the technology world. No disruptive technology is without its challenges and potential impediments t...
If a machine can invent, does this mean the end of the patent system as we know it? The patent system, both in the US and Europe, allows companies to protect their inventions and helps foster innovation. However, Artificial Intelligence (AI) could be set to disrupt the patent system as we know it. This talk will examine how AI may change the patent landscape in the years to come. Furthermore, ways in which companies can best protect their AI related inventions will be examined from both a US and...
Bill Schmarzo, Tech Chair of "Big Data | Analytics" of upcoming CloudEXPO | DXWorldEXPO New York (November 12-13, 2018, New York City) today announced the outline and schedule of the track. "The track has been designed in experience/degree order," said Schmarzo. "So, that folks who attend the entire track can leave the conference with some of the skills necessary to get their work done when they get back to their offices. It actually ties back to some work that I'm doing at the University of San...
When talking IoT we often focus on the devices, the sensors, the hardware itself. The new smart appliances, the new smart or self-driving cars (which are amalgamations of many ‘things'). When we are looking at the world of IoT, we should take a step back, look at the big picture. What value are these devices providing. IoT is not about the devices, its about the data consumed and generated. The devices are tools, mechanisms, conduits. This paper discusses the considerations when dealing with the...