In this episode, we take another look, a fresh perspective on Data Science for Logistics. In earlier episodes, we talked about related topics like Forecasting and crafting a Picking Algorithm using Data Science. With the guests of this episode, we dive into two specific challenges in the warehousing domain, the cost-based selection is the first one and the second is put-away with simulation as the red line. During the recording, we found out that our guests have a big dream: realizing the Digital Twin of the bol.com fulfilment centre. But…Read more
Spaces Summit is our annual internal bol.com conference for IT, by IT, and friends: Two fun days of inspiration, knowledge sharing, bragging, and community. This year we had our first digital version of the summit. My name is Asparuh Hristov and working as a Data Science Craft Lead in the Forecasting team, I keep encountering people that believe Machine Learning can automate all existing jobs, including Data Science itself. I hope that this talk can bring a bit more of understanding what Data Science is, which steps can be automated…Read more
The topic of this show is exactly why we like to be the hosts of this podcast. We learn a lot about all areas in our business. In this show, we dive into the marketing domain. We will learn about models like multi-touch, lift studies, time decay, and more. We talk about Data Science for Marketing Attribution. Put simply, marketing attribution is the analytical science of determining which marketing tactics are contributing to sales or conversions. And then there is a Data Science twist to this. Guests Ernst Kuiper; Data…Read more
Let’s look at another programming language we use at bol.com: Python. At bol.com we use Python for Tools, Data Science and more. It has been around since 1991. It was created by a Dutch guy: Guido van Rossum. Its design philosophy emphasizes code readability. The language’s core philosophy is summarized in the document The Zen of Python (PEP 20), which includes aphorisms such as: Beautiful is better than ugly. Explicit is better than implicit. Simple is better than complex. Complex is better than complicated. Readability counts. All well, but what…Read more
Cloud as Enabler to become more data-driven. The adjective data-driven means that progress in an activity is compelled by data, rather than by intuition, personal experience or a gut feeling. Googling data-driven gives you over 637 mln. results so time to add an extra result to this with this episode. But we wanted to make it more specific. We moved our big data into the cloud and we got some nice insights that we wanted to share. We asked our two guests of this episode to do so. It turned…Read more
In this episode, we talk with Daniel and Emiel, software engineer and product owner in the customer support domain. In this domain, the focus is to help our customers in the best way possible. But what if we can prevent the customer to feel the need to contact bol.com in the first place, they asked themselves. They realized this can be possible using the analyses of the various customer interactions we have via the Chatbot “Billie”, live chat, phone and email. For these analyses, they introduced techniques from the Data…Read more
As long as retail exists, people tried to predict the future. An accurate forecast makes it much easier to buy the correct amount of products from suppliers, know what you need to keep on stock and even know what the sales will do with specific promotions. Over the last couple of years, this domain changed dramatically because of the introduction of Data Science and Artificial Intelligence. In this episode, we chat about this changing playing field to share our experiences with you. Guests Harmen Prins Erick Webbe Hosts Peter Brouwers…Read more
In many systems at bol.com the response speed of our systems is very important. This blog is about the data structures and algorithms we used to make a specific analysis step a lot faster: Finding the longest matching string prefix.Read more
Data Science and Machine Learning are becoming more integrated into current businesses. Especially in e-commerce there is huge potential for predictive modeling. It is therefore no surprise that bol.com has given extra focus on significantly expanding its Data Science efforts the coming year. That’s not to say that there aren’t already some interesting Data Science projects running. In this blog post we will take a look at one of the projects I am currently working on with fellow data scientist Joep Janssen: the chunk project.Read more
Ever since I’ve started working for a WebAnalytics company in 2005 I’ve been working on problems related to making sense of web data. One of the most difficult elements in this type of analysis is making sense of the user agent.
Very often the raw web data I work with is stored in Apache HTTPD access log files that have been compressed using gzip.