SQL Server - Optimizing Performance of a Large Dataset

N73k

6/4/23, 4:53 PM

I'm using SQL Server running on an Azure VM with 8 SSDs. The SSDs are grouped together in Storage Spaces as 1 disk - in order to increase the capacity and also to combine the IOPS/Throughput. But the "combine the IOPS" part just doesn't seem to be working as far as I can tell by all of my tests/benchmarks (the "combine the throughput" part is working though). In fact, it looks like the SSD performance (IOPS) are better on 1 single disk than the whole 8-physical-disk virtual disk. So, I'm thinking about just forgetting about Storage Spaces and splitting up my data across 8 disks.

But what would be the best way to do that? (I don't have much experience with mulitple files, or filegroups, or partitioning tables, and that sort of thing.)

Just make 8 mdf files (1 on each disk) and let SQL Server redistribute the data across all of these files? If so, I would like to know how SQL Server knows which disk a given record is on. Would doing this speed things up?

And maybe split up the ldf files too?

What about multiple filegroups? I really don't know what the practical difference is between multiple files and filegroups.

What about splitting up the big tables somehow by using a partitioning function? Would that help, since now, maybe, SQL Server would "have a better idea" of where (in which file) a given record would be - since that is defined by a partition function?

0 + 0

hard-drive

sql-server

azure

mfinni

6/4/23, 5:46 PM

What problem are you experiencing, that you're trying to solve?

Ben Thul

6/4/23, 9:50 PM

I'd be curious about your Storage Spaces setup. While it was on AWS, I had a good experience with a RAID-0 setup across multiple EBS volumes.

SQLpro

6/22/23, 9:09 AM

Are you sure that the problem is a disk throughput one ? What metrics did you take? And with what tool?

N73k

6/22/23, 3:13 PM

I used ATTO Disk Benchmark. The problem is with the IOPS, not the throughput. Thanks.

Elon Musk

I sit in a Tesla and translated this thread with Ai:

EN: SQL Server - Optimizing Performance of a Large Dataset

TH: SQL Server - การเพิ่มประสิทธิภาพของชุดข้อมูลขนาดใหญ่

RO: SQL Server - Optimizarea performanței unui set de date mare

RU: SQL Server — оптимизация производительности большого набора данных

VI: SQL Server - Tối ưu hóa hiệu suất của một tập dữ liệu lớn

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.