I’m having a very strange issue with a 3850 right now. We use 3850’s as building routers on campus. Each building gets its own 3850 and then there is a variety of 4500, 2960X and 9300 switches deployed for layer 2.
On each 3850 we have the traffic separated into 2 vrfs: let’s call them Campus and Dist. I was recently asked to look at multicast configurations. We had a history of pim-sparse mode configs that no one had ever really examined. Long story short, it seems that pim-sparse was put on interfaces but no rp address was ever configured nor was there are any thought to passing multicast from one vrf to the other–if ever required.
Changes that I made:
Added a RP for each vrf using the loopback interface address for that vrf.
Created a standard ACL called MCAST-GRP and added the address I was testing with: 239.1.1.5
Added a config line to steer the RPF process so as to not drop traffic coming from one vrf to the other:
ip multicast vrf Dist rpf select vrf Campus group-list MCAST-GRP
Confirmed that multicast-routing was enabled for each vrf:
ip multicast-routing vrf Campus
ip multicast-routing vrf Dist
I then used a command line utility to test (multicast-tester). This is a simple python based multicast testing tool that I use on Macs. On Windows or Linux, the CisTech NetSpanner is a fantastic tool for testing multicast, but alas there is no Mac version.
I ran a test and it worked fine. I was able to start a multicast stream on a host in vrf Campus and join that group on a host in vrf Dist. After that I was looking at some show commands and ran:
sh ip pim vrf Campus rp mapping
This immediately hung my console and I had to open another session. There was no high cpu or mem usage to explain the hung console. After that, my rpf select mulitcast tests fail. I removed all of the vrf based multicast config that I had added and went home. I came back this morning and laid the commands back in and tested. Again, it worked fine. Ok, good. I am, in fact, not crazy. I then ran that same show command and again the same result. My console hung (but at least this time it did show the mapping), and my multicast tests fail again.
I can still ssh in from a new session and the router is fine. But this seems like pretty strange behaviour. And I just tested again and multicast is still broken from one vrf to the other. My console session did eventually come back. It took a few minutes. Very odd.
Anyone have any potential insight here?