Dr. James McCaffrey of Microsoft Research presents a full-code, step-by-step tutorial on an implementation of the technique that emphasizes simplicity and ease-of-modification over robustness and performance.
- By James McCaffrey
- 02/01/2024
Transforming a dataset into one with fewer columns is more complicated than it might seem, explains Dr. James McCaffrey of Microsoft Research in this full-code, step-by-step machine learning tutorial.
- By James McCaffrey
- 01/17/2024
You've carefully crafted a concurrent application taking advantage of all the latest and greatest language features. It scales magnificently and it's the jewel of the cloud. But what about the database? What happens when simultaneous queries are executed against the same data?
Dr. James McCaffrey of Microsoft Research guides you through a full-code, step-by-step tutorial on "one of the most important operations in machine learning."
- By James McCaffrey
- 01/03/2024
Spectral clustering is quite complex, but it can reveal patterns in data that aren't revealed by other clustering techniques.
- By James McCaffrey
- 12/18/2023
K-means is comparatively simple and works well with large datasets, but it assumes clusters are circular/spherical in shape, so it can only find simple cluster geometries.
- By James McCaffrey
- 12/01/2023
Microsoft shipped ML.NET 3.0, enhancing deep learning and data processing scenarios in the company's machine language framework that lets devs create AI-infused apps completely within the .NET ecosystem.
Compared to other clustering techniques, DBSCAN does not require you to explicitly specify how many data clusters to use, explains Dr. James McCaffrey of Microsoft Research in this full-code, step-by-step machine language tutorial.
- By James McCaffrey
- 11/15/2023
Dr. James McCaffrey of Microsoft Research explains GMM clustering in a full-code, step-by-step tutorial, noting his data scientists colleagues have different opinions about the complicated technique.
- By James McCaffrey
- 11/01/2023
Jim Wooley takes 20 minutes to discuss Table Splitting, Owned Types, Query Filters, DbContext pooling, GroupBy, String Interpolation, Lazy loading, Value Converters, Data Seeding, Query types, Optimized correlated subqueries, Spatial support, Query Tags and more.
The main advantages of KNNR are simplicity and interpretability, says Dr. James McCaffrey of Microsoft Research in presenting this full-code, step-by-step tutorial.
- By James McCaffrey
- 09/05/2023
Integrating the data science darling programming language with the super-popular spreadsheet program means analysts will no longer have to jump through so many workaround hoops or use third-party add-ins.
KRR is especially useful when there is limited training data, says Dr. James McCaffrey of Microsoft Research in this full-code, step-by-step tutorial.
- By James McCaffrey
- 08/15/2023
The open source project lets developers and data pros to use its Semantic Kernel SDK to experiment and test the abilities of LLMs to generate SQL queries based on natural language expressions.
Implementing LRR from scratch is harder than using a library like scikit-learn, but it helps you customize your code, makes it easier to integrate with other systems, and gives you a complete understanding of how LRR works.
- By James McCaffrey
- 08/01/2023